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bonds in consecutive order and have a sequence identical to the sequence set forth in SEQ ID NO: 1 beginning with the P at position 
19 and extending therefrom in the carboxy terminal direction; wherein G represents an amino group or an acetylated amino group; 
wherein X represents a carboxyl group or an amidated carboxyl group; wherein all of ct,Y,D.l,N,Y,Y,T,S,E and P are joined together 
^ by peptide bonds; further provided that at least two tyrosines in the compound are sulfated. 



WO 01/647J0 



PCT/US01/06699 



SULFATED CCR5 PEPTIDES FOR HIV-1 INFECTION 
5 This application is a continuation-in-part of and claims 
the benefit of U.S. Provisional Application No. 60/267,231, 
filed February 7, 2 001, U.S. Provisional Application No. 
60/205,835, filed May 19, 2000 and U.S. Provisional 
Application No. 60/185,667, filed February 29, 2000, the 
10 contents of which are hereby incorporated by reference into 
this application. 

The invention disclosed herein was made with Government 
support under NIH Grant Nos . R01A143 847 (T.D.) and 
15 R01DK54718 (T.P.S.) from the Department of Health and Human 
Services. Accordingly, the government has certain rights in 
this invention. 

Thx-oughout this application, • various publications are 
20 referenced within parentheses. Disclosures of these 
publications in their entireties are hereby incorporated by 
reference into this application to more fully describe the 
state of the art to which this invention pertains. Full 
bibliographic citations for these references may be found 
25 immediately preceding the claims. 

Background of the Invention 

HIV-1 entry into target cells is mediated by the successive 
interaction of the envelope glycoprotein gpl20 with CD4 and 
30 a co-receptor belonging to the seven trans -membrane G 
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protein-coupled chemokine receptor family (Berger et al . 
Ann. Rev. Immunol. 17:657, 1999). Binding of gpl20 to CD4 
exposes or creates a co- receptor binding site on gpl2 0 
(Trkola et al . Nature 384:184, 1996, Wu et al . Nature, 
5 384:179, 1996). CCR5 and CXCR4 are the most physiologically 
relevant and widely used HIV-i co-receptors (Zhang and 
Moore, J. Virol. 73:3443, 1999). CCR5 mediates the entry of 
R5 isolates and CXCR4 mediates the entry of X4 isolates. 
R5X4 isolates are able to exploit both co-receptors (Berger 

:0 et al . Ann. Rev. Immunol. 17:657, 1999). It has been 
demonstrated that specific amino acids including acidic 
residues and tyrosines located within the CCR5 amino- 
terminal domain (Nt, amino acids 2-31) are essential for 
CCR5-mediated fusion and entry of R5 and R5X4 HIV-1 strains 

5 (Dragic et al . J. Virol. 72:279, 1998; Rabut et al . J. 
Virol. 72:3464, 1998; Farzan et al . J. Virol. 72:1160, 
1998; Dorantz et al . J. Virol. 71:6305, 1997). More 
recently, Farzan et al . demonstrated that tyrosine residues 
in the CCR5 Nt are sulfated (Farzan et al . Cell 96:667, 

0 1999) 



Inhibition of cellular sulfation pathways, including 
tyrosine sulfation, by sodium chlorate decreased the 
binding of a gpl20/CD4 complex to CCR5 + cells (Farzan et al . 
Cell 96:667, 1999). A number of prior reports had 
implicated a role for sulfate moieties in HIV-l entry. 
Several sulfated compounds, such as dextran sulfate, can 
inhibit HIV-1 entry by associating with CD4 or gp!20 
(Baeuerle and Huttner J. Cell Biol 105:2655, 1987; Baba et 
al . Proc. Natl. Acad. Sci . USA 85:6132, 1998). Sulfated 
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proteoglycans have been shewn to bind to HIV-1 gpl20 at or 
near its third variable (V3) loop, which also determines 
co-receptor usage (Roderiguez et al . J. Virol. 69:2233, 
1995; Hwang et al . Science 253:71, 1991). It is therefore 
5 conceivable that sulf o-cyrosines in the CCR5 Nt also 
interact with gp!20, increasing its affinity for CCR5 . The 
reduction in gp!20/CD4 binding caused by the pre- treatment 
of target cells with sodium chlorate, however, cannot be 
formally attributed to a reduction in CCR5 tyrosine 
10 sulfation since chlorate can inhibit the sulfation of both 
tyrosines and proteoglycans. 

The region of the CCR5 Nt spanning amino acids 2-18 
contains residues that are critically important for viral 

15 entry (Dragic et al . J. Virol. 72:279, 1998; Rabut et al . 
J. Virol. 72:3464, 1998; Farzan et al . J. Virol. 72:1160, 
1998; Dorantz et al . J. Virol. 71:6305, 1997). We 

previously demonstrated that tyrosines at positions 3, 10 
and 14 were required for optimal co-receptor function, 

20 whereas the TyrlSPhe substituti on had little effect on 
entry (Rabut et al . J. Virol. 72:3464, 1998). Taken 
together, these findings suggested that HIV-1 entry may be 
critically dependent upon sulfation of Tyr-3, -10 and -14, 
but not Tyr-15. We therefore explored the role of sulfo- 

25 tyrosines in positions 3, 10 and 14 by synthesizing 
peptides corresponding to amino acids 2-18 of the CCR5 Nt 
and carrying different tyrosine modifications. We first 
tested the ability of the Nt peptides to inhibit binding of 
gpl20/CD4 complexes and anti-CCR5 MAbs to CCR5 + cells. The 

30 specific association of certain peptides with gpl20/sCD4 
complexes or with anti-CCR5 MAbs was further confirmed by 



surface plasmon resonance (EIAcore) analysis. Inhibition of 
HIV-1 entry by the CCR5 Nt peptides was also tested. Our 
results suggest that amino acids 2-18 of the CCR5 Nt 
compose a gpl20-bindinc site that determines the 
5 specificity of the interaction between CCR5 and gp!20s from 
R5 and R5X4 isolates. Post - translationai sulfation of the 
tyrosine residues in the CCR5 Nt is required for gpl20 
binding and may critically modulate the susceptibility of 
target cells to HIV-1 infection in vivo. 

10 

CCR5 ' s normal physiologic activities involve binding and 
transducing signals mediated by CC-chemokines, including 
RANTES, MlP-la and MIP-1(L, which direct activation and 
trafficking of T cells and other inflammatory cells. As 

15 such, CCR5 plays an important role in mediating the 
inflammatory reaction of diseases such as rheumatoid 
arthritis and multiple sclerosis. The synovial fluid of 
rheumatoid arthritis patients is highly enriched in CCR5- 
expressing T cells (Qin et al . J Clin Invest 101:746, 

20 1998) , and CCR5 is the predominant CC chemokine receptor- 
expressed on T cells in the rheumatoid synovium (Gomez- 
Reino et aJ. Arthritis Rheum 42:989, 1999). Similarly, 
infiltration by CCR5-expressing cells is characteristic of 
plaque lesions in patients with multiple schlerosis 

25 (Balashov et al . Proc Natl Acad Sci USA 96:6873, 1999). 
Such observations provide a rationale for the use of agents 
that block CCR5 for therapy of inflammatory/autoimmune 
diseases, including but not limited to arthritis, multiple 
sclerosis, asthma, psoriasis, autoimmune diabetes, 

30 transplant rejection, and atherosclerosis. 
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Summary of the Invention 

This invention provides a compound comprising the 
structure : 

0aYDINYYTSE3A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoieucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; wherein 3 
represents from 0 to 13 amino acids, with the proviso that 
if there are more than 2 amino acids, they are joined by 
peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO: 1 
beginning with the P at position 19 and extending therefrom 
in the carboxy terminal direction; wherein 6 represents an 
amino group or an acetylated amino group; wherein X 
represents a carboxyl group or an amidated carboxyl group; 
wherein all of a,Y,D, I,N,Y,Y,T,S,E and (3 are joined 
together by peptide bonds; further provided that at least 
two tyrosines in the compound are sulfated. 

This invention also provides a compound comprising the 
structure : 

GaYDINYYTSEPA 

wherein each T represents a threonine, each S represents a 
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serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
5 with the proviso that if there are more than 2 amino acids, 
they are joined by peptide hcnds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extendino 
therefrom in the amino terminal direction; wherein 3 

10 represents from 0 to 333 amino acids, with the proviso that 
if there are more than 2 amino acids, they are joined by 
peptide bonds in consecutive order and have a sequence 
identical to the sequence sei fox~th in SEQ ID NO: 1 
beginning with the P at position 19 and extending therefrom 

15 in the carboxy terminal direction; 

wherein 6 represents an amino group or an acetylated amino 
group; wherein A represents a carboxyl group or an amidated 
carbcxyl group; wherein all of a,Y,D, I,N,Y,Y,T,S,E and $ 
are joined together by peptide bonds; further provided that 
20 at least two tyrosines in the compound are sulfated. 

This invention provides a composition which comprises a 
carrier and an amount of one of the compounds described 
herein effective to inhibit binding of HIV-1 to a CCR5 
25 receptor on the surface of a CD4+ cell. 

This invention provides a method of inhibiting human 
immunodeficiency virus infection of a CD4+ cell which also 
carries a CCR5 receptor on its surface which comprises 
30 contacting the CD4+ cell with ar. amount of one of the 
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compounds described herein effective to inhibit binding of 
human immunodeficiency virus to the CCR5 receptor so as to 
thereby inhibit human immunodeficiency virus infection of 
the CD4 + cell. 

This invention provides a method of preventing CD4 + cells 
ci a subject from becoming infected with human 
immunodeficiency virus which comprises administering to the 
subject an amount of one of the compounds described herein 
10 effective to inhibit binding of human immunodeficiency 
virus to CCR5 receptors on the surface of the CD4 + cells so 
as to thereby prevent the subject's CD4+ cells from 
becoming infected with human immunodeficiency virus. 

15 This invention provides a method of treating a subject 
whose CD4 + cells are infected with human immunodeficiency 
virus which comprises administering to the subject an 
amount of one of the compounds described herein effective 
to inhibit binding of human immunodeficiency virus to CCR5 

20 receptors on the surface of the subject's CD4+ cells so as 
to thereby treat the subject. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
25 which comprises : 

(a) immobilizing one of the compounds described herein on 
a solid support ; 

(b) contacting the immobilized compound from step (a) with 
sufficient detectable CCR5 ligand to saturate all 

30 binding sices for the CCR5 ligand on the immobilized 



compound under conditions permitting binding of the 
CCR5 ligand to the immobilized compound so as to form 
a complex; 
(c) removing any unbound CCR5 ligand; 

(c; contacting the complex from step (b) with the agent; 
and 

(e) detecting whether any CCR5 ligand is displaced from 
the complex, wherein displacement of detectable CCR5 
ligand from the complex indicates that the agent bines 
to the compound so as to thereby identify the agent as 
one which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

(a) contacting one of the compounds described herein with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the compound 
under conditions permitting binding of the CCR5 ligand 
to the compound so as to form a complex; 

(b; removing any unbound CCR5 ligand; 

(c) measuring the amount of CCR5 ligand which is bound tc 
the compound in the complex; 

(d) contacting the complex from step (a) with the agent so 
as to displace CCR5 ligand from the complex; 

(e; measuring the amount of CCR5 ligand which is bound tc 
the compound in the presence of the agent ; and 

(f) comparing the amount of CCR5 ligand bound to the 
compound in step (e) with the amount measured in step 
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(c) , wherein a reduced amount measured in step (e) 
indicates that the agent binds to the compound so as 
to thereby identify the agent as one which inhibits 
binding of the CCRS ligand to the CCRS receptor. 

This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCRS 
receptor which comprises: 

(a) immobilizing one of the compounds described herein on 
10 a solid support; 

(b) contacting the immobilized compound from step (a) with 
the agent and sufficient detectable CCRS ligand to 
saturate all binding sites for the CCRS ligand on the 
compound under conditions permitting binding of the 

15 CCRS ligand to the immobilized compound so as to form 

a complex; 

(c) removing any unbound CCRS ligand; 

(d) measuring the amount of detectable CCRS ligand which 
is bound to the immobilized compound in the complex; 

20 (e) measuring the amount of detectable CCR5 ligand which 
binds to the immobilized compound in the absence of 
the agent; 

(f) comparing the amount of CCRS ligand which is bound to 
the immobilized compound in step (e) with the amount 
25 measured in step (d) , wherein a reduced amount 

measured in step (d) indicates that the agent binds tc 
the compound so as to thereby identify the agent as 
one which inhibits binding of the CCRS ligand to the 
CCRS receptor. 

30 
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This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCR5 
receptor which comprises: 

(a) contacting one of the compounds described herein with 
5 the agent and sufficient detectable CCR5 ligand to 

saturate all binding sites for the CCR5 ligand on the 
compound under conditions permitting binding of the 
CCR5 ligand to the compound so as to form a complex; 

(b) removing any unbound CCR5 ligand; 

10 (c) measuring the amount of detectable CCR5 ligand which 
is bound to the compound in the complex; 

(d) measuring the amount of detectable CCR5 ligand which 
binds to the compound in the absence of the agent; 

(e) comparing the amount of CCR5 ligand which is bound to 
15 the compound in step (c) with the amount measured in 

step (d) , wherein a reduced amount measured in step 
(c) indicates that the agent binds to the compound so 
as to thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

20 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

a; immobilizing one of the compounds described 
25 herein on a solid support; 

b) contacting the immobilized compound from step a) 
with the agent dissolved or suspended in a known 
vehicle and measuring the binding signal 
generated by such contact; 
30 c; contacting the immobilized compound from step a' 



with the known vehicle in the absence of the 
compound and measuring the binding signal 
generated by such contact; 
d} comparing the binding signal measured in step b) 
5 with the binding signal measured in step c) , 

wherein an increased amount measured in step b) 
indicates that the agent binds to the compound so 
as to thereby identify the agent as one which 
binds to the CCR5 receptor. 

10 

This invention provides a method of obtaining a composition 
which comprises : 

(a) identifying a compound which inhibits binding of a 
CCR5 ligand to a CCR5 receptor according to one of the 

15 above methods; and 

(b) admixing the compound so identified or a homolog or 
derivative thereof with a carrier. 

This invention provides a compound having the structure: 
20 A- (aYDINYYTSEgA) n 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each 1 represents an isoleucine; and each N represents an 

25 asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position S 

30 and extending therefrom in the amino terminal direction; 
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wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
5 ID NO: 1 beginning with tne P at position 19 and extendino 
therefrom in the carboxy terminal direction; wherein A 
represents a carboxyl group or an amidated carboxyl group ; 
wherein all of a / Y,D,I / N,Y,Y,T,S,E and 3 are joined 
together by peptide bonds , further provided that at least 
10 two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to 8, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

15 This invention also provides a compound having the 
structure : 

( 6aYDINYYTSEp ) n - A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 

20 represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoieucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 

25 order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amine terminal direction ; 
wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 

30 joined together by peptide bonds in consecutive order and 
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have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein 6 
represents an amino group or an acetylated amino group; 
5 wherein all of a , Y , D , I , N , Y , Y , T , S , E and (5 are joined 
together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to S, A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
10 parentheses to A . 

This invention provides a compound having the structure: 
A - (aYDINYYTSEpX)^ 

wherein each T represents a threonine, each S represents a 

15 serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 

20 they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth an SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein (b represents from 0 to 333 amino acids, with the 

25 proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein a 

30 represents a carboxyl group or an ami dated carboxyl group; 
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wherein all of a, Y , D, I , N, y, Y, T , S , E and (3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, wherein n is an integer from 1 to 6, 
5 A is a polymer, and the solid line represents up to £ 
linkers which attach the structure in parentheses to A. 



This invention also provides a compound having the 
structure : 

10 ( 6aYDINYYTSE@ ) n ~~ A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 

15 asparagine; wherein o; represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position S 

20 and extending therefrom in the amino terminal direction; 
wherein 3 represents from 0 to' 333 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 

25 ID NO: 1 beginning with the P at position 19 and extendinc 
therefrom in the carbcxy terminal direction; wherein 6 
represents an amino group or an acetylated amino group; 
wherein all of a , Y , D , I , N , Y , Y , T, S , E and p are joined 
together by peptide bonds, further provided that at least 

30 two tyrosines in the compound are sulfated, wherein n is an 




integer from 1 to 8, A is 
represents up to 8 linkers 
parentheses to A. 
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a polymer, and the solid line 
vhich attach the structure in 



-16- 

Brief Description of the Figures 



Fig. 1 Effect of peptides on gp!20 JK _ FL binding to CCR5 . 

L1.2-CCR5"* cell? were incubated with the 
5 biotinylated gpl2 G JR _ FL /CD4 - IcG2 complex in the 

presence of different concentration of peptides 
(a) S-3/10/14, £-10/14, S-10, S-14 or (b) P- 
3/10/14, SR-2/12, SR-10/14, TS-10/14. 

The extent of complex binding in the absence of 

10 peptide was defined as 100% (m.f.i. ~40±5) . 

Einding in the presence of peptide is expressed 
as a percentage of control. When CCR5 -negative 
cells were used, binding of the gp!2 0 JR _ FIj /CD4 - IgG2 
complex was negligible (-10%, m.f.i. -2+1). The 

15 values shown are from a representative 

experiment . 



Fig. 2 Einding of the gpl20/sCD4 complex to sulfated and 

phosphorylated peptides. 

20 Biotinylated peptides were immobilized on a 

sensor chip and their ability to associate with 
gpl20/sCD4 was analyzed by BIAcore . RU values as 
a function of time were measured in the absence 
of peptide (gray dotted lines) , in the presence 

25 of phosphorylated peptide (black dotted lines) or 

m the presence of sulfated peptide (solid black 
lines) . We performed binding analyses with the 
following proteins: (a) gp!2 0 JR . FL /sCD4 , (b) gpl20jp 
FL , (c) sCD4. (d) DV3gpl2G JR _ FL /sCD4, (e) 

30 gpl2G DK123 /sCD4 f (f; gpl20 DK123# (g) gpl20 1AI /sCD4 anc 
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(h) gp!20 LA1 . 



Fig. 3 Effect of peptides on MAb binding to CCR5 . 

L1.2-CCR5 4 cells were incubated with the anti-CCR5 
5 MAbs in the presence of peptides. The extent of 

MAb binding in the absence of peptide was defined 
as 100% (m.f.i. -50-400, depending on the MAb). 
Einding in the presence of peptide is expressed 
as a percentage or control. When CCR5 -negative 
10 cells were used, binding of MAbs was negligible 

(m.f.i. -2+1). Each data point represents the 
mean + s.d. of three replicates. 



Fig. 4 Binding of MAbs to sulfated and phosphorylated 

15 peptides. 

Biotinylated peptides were immobilized on a 
sensor chip and their ability to associate with 
anti-CCR5 MAbs was analyzed by BIAcore . RU values 
as a function of time were measured in the 
20 absence of peptide (gray dotted lines) , in the 

presence of phosphorylated peptide (black dotted 
lines) or in the presence of sulfated peptide 
(solid black lines) . We performed binding 
analyses with (a) PAS, (b) PA10 and (c) 2D7 . 



OS 



Fig. 5 Effect of peptides on viral entry. 

HeLa-CD4"CCR5* cells were infected with Nlluc 4 env" 
pseudotyped with different viral envelopes in the 
presence of peptides. Lucif erase activity 
(r.l.u.) was mesured 48 h post- infection. The 
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extent of entry in the absence of peptide was 
defined as 100% (r.l.u.. ~25,000± 9,000). 
Background r.l.u. values were -7+2. Each data 
point represents the mean + s.d. of three 
5 replicates. 



Fig. 6 CCR5 Nt peptide sequences and labels 

The primary sequence of each peptide is indicated 
in the left column and the corresponding label is 
10 indicated in the right column. Sulfated tyrosine 

residues are designated by black boxes and white 
boxes designate phosphorylated tyrosine residues. 

Fig. 7 Gpl2 0/CD4 complex binding to CCR5 Nt 

15 sulf opeptides 

Peptide 2-18 was bound to streptavidin-coated 
biosensor chips and gpl2 0jr.pl/sCD4 (dotted line) 
or gp!2 0jr.pl/CD4- I gG 2 (solid line) were flowed over 
the sensor chip surface. Resonance units (RU) 

20 were measured as a function of time using a 

Biacore X and reflect complex-peptide binding 
(a) . Sulfopeptide 2-18 (solid symbols) or 
phosphopept ide 2-18 (P) (clear symbols) were 
immobilized on streptavidin-coated ELISA plates 

25 and incubated with gp!20/CD4 - IgG 2 complexes. Gpl20 

proteins were derived from the R5 isolate JR-FL 
(squares), the R5X4 isolate DH123 (circles) and 
the X4 isolate LAI (diamonds) . Complexes -peptide 
binding was detected by an HRP-conjugated goat 

30 anti-human IgG antibody. O.D. at 450 nm was 
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measured after addition of HRP substrate and is 
expressed as a function of CD4-IgG 2 concentration 
(b) . Biotinylated sulfopeptide 2-18 was 
immobilized on streptavidin-coated plates and 
5 incubated with gpl20/CD4 -IgG 5 complex in the 

presence of increasing concentrations of: PA 8 
(solid squares) , TAK-779 (triangles) , Rantes 
(inverted triangles), MIP-1 (diamonds), MIP-1 
(circles) or SDF-1 (clear squares) . Binding of 

10 the complexes to the peptide was detected by 

incubation with HRP-conjugated goat anti-human 
IgG antibody. O.D. at 450 nm was measured after 
addition of HRP substrate and percentage of 
binding was expressed as a function of inhibitor 

15 concentration. 

Fig. 8: Binding of anti-CCR5 KAbs to CCR5 Nt peptides. 

Sulf opeptides (a) or phcsphopept ides (b) were 
immobilized on streptavidin-coated EL.ISA plates 

20 and incubated with anti-CCR5 MAbs PA 8 (solid 

squares) , PA10 (clear circles) , PA11 (solid 
circles) , PA12 (solid diamonds) or PA14 (solid 
triangles) . Binding of the MAbs to the peptides 
was detected by an HRP- con j ugated goat anti-mouse 

25 IgG antibody. O.D. at 450 nm was measured after 

addition of HRP substrate and expressed as a 
function of MAb concentration. 



Fig. 9: Binding of gpl2 0 JR . FL /CD4 -IgG 2 to different CCR5 Nt- 
30 based peptides. 
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Streptavidin plates were coated with 2-18 (black 
squares) , 10-18 (black circles) , 8-15 (black 
diamonds), 6-16 (black stars), 10-15 (white 
square), 10- 18 ( 11A/18A) (black triangles). Plates 
5 were then incubated with gpl2 0 JR . FL /CD4 - IgG 2 

complex. Binding of the complex to the peptide 
was detected by an HRP-conjugated goat anti-human 
IgG antibody. O.D. at 450 nm was measured after 
addition of HRP substrate and expressed as a 
10 function of CD4-IgG 2 concentration (nM) . 



Fig. 10: Inhibition of gpl2 0/CD4 - IgG 2 complex binding to 
sulf o-peptides by anti-gpl20 MAbs 

Biotinylated sulfopeptide 2-18 was bound to 
15 streptavidin-coated biosensor chips and solutions 

of either gp!2 0 JR _ FIj /CD4 - IgG 2 complex (black bars) 
or gpl2 0 DH123 /CD4-IgG 2 complex (white bars) were 
flowed over the surface of the chip in the 
presence of different anti-gpl20 MAbs. The names 
20 of the MAbs and the location of their epitopes 

are indicated along the x-axis. Resonance units 
(RU) were measured as a function of time using a 
Biacore X and reflect complex-peptide binding in 
the presence of the MAbs. Gpl20/CD4-IgG 2 binding 
25 was calculated using the formula: (RU in the 

presence of MAbs) / (RU in the absence of MAbs) 
xl00%. The values shown are from a sample 
experiment . 
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Fig. 11: Binding of gpl20 mutants to sul f o-peptide and 



10 



15 



20 



25 

Fig. 12: 
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wild type CCR5 . 

Sulf o-peptide 2-18 was immobilized on 

streptavidin-coated plates and incubated with a 
mixture of gpl20-containing supernatants and CD4- 
IgG 2 . Peptide-complex binding was detected by an 
HRP-conjugated goat anti-human IgG antibody. O.D. 
at 450 nm was measured after addition of HRP 
substrate and normalized for binding of the gpl20 
mutants to CD4-IgG 2 . The doted line represents the 
normalized value for the binding of the wild type 
gpl2 0 to the peptide. The mutated amino acids and 
their locations in gpl20 are indicated along the 
x-axis (a) . L12-CCR5* cells were incubated with a 
mixture of gpl20-containing supernatants and CD4 - 
IgG 2 . Binding of the complex was detected by FACS 
analysis after addition of streptavidin- PE . 
Percentage of gpl20/CD4-lgG 2 binding to CCR5 was 
normalized for gpl2 0 binding to CD4-IgG 2 . The 
doted line represents the normalized value for 
the binding of wild-type gpl20 to the L12-CCR5+ 
cells. The mutated amine acids and their 
locations in gpl20 are indicated along the x-axis 
(b) . 



Amino acid sequences of CCR5 Nt-based peptides. 

The peptides are named according to the positions 
of their first and last residues in the full- 
length sequence of CCR5 . They contain either 
sulf otyrosines (black bcxes) or phosphotyrosines 
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(white boxes) n positions 10 and 14. Residues 
Asp-11 and Glu-18 in peptide 10-18 (11A/18A) are 
substituted for alanines. All peptides carry a 
carboxy terminal GAG spacer followed by a 
5 biotinylated lysine. 



Fig- 13: Amino acid conservation among R5 isolates. 

Envelope sequences from 25 R5 strains described 
in the HIV Database and retrieved from the 

10 National Center for Biotechnology Information 

GenBank were aligned and percentage of 
conservation for the indicated residues was 
calculated and combined with results from Hung et 
al. , 1999 (REF) . Alanine mutants showing more 

15 than 50 % decrease in sulfopeptide 2-18 binding 

compared to the wild type are highlighted in 
gray. 
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Detailed Description of the Invention 

The plasmids CD4 -IgG2-HC-pRcCMV and CD4 -kLC-pRcCMV were 
deposited pursuant to, and in satisfaction of, the 
requirements of the Budapest Treaty on the International 
5 Recognition of the Deposit of Microorganisms (the "Budapest 
Treaty") for the Purposes of Patent Procedure with the 
American Type Culture Collection (ATCC) , 10801 University 
Boulevard, Manassas, Virginia 20110-2209 under ATCC 
Accession Nos. 75193 and 75194, respectively. 

10 

The plasmids designated PPI4-tPA-gpl20 JRTFL and PPI4-tPA- 
gpl20 LAI were deposited pursuant to, and in satisfaction of, 
the requirements of the Budapest Treaty on the 
International Recognition of the Deposit of Microorganisms 

15 for the Purposes of Patent Procedure with the American Type 
Culture Collection (ATCC) , 10801 University Boulevard, 
Manassas, Virginia 20110-2209 under ATCC Accession Nos. 
75431 and 75432, respectively. These plasmids were 

deposited with ATCC on March 12, 1993. These eukaryotic 

20 shuttle vectors contain the cytomegalovirus major 
immediate-early (CMV MIE) promoter/enhancer linked to the 
full-length HIV-1 envelope gene whose signal sequence was 
replaced with that derived from tissue plasminogen 
activator. In the vector, a stop codon has been placed at 

25 the gpl20 C-terminus to prevent translation of gp41 
sequences, which are present in the vector. The vector 
also contains an ampicillin resistance gene, an SV40 origin 
of replication and a DKFR gene whose transcription is 
driven by the 3-globin promoter. 



30 



10 
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The monoclonal antibodies PA8 , PA10, PA11, PA12, and PA14 
were deposited pursuant to and in satisfaction of, the 
requirements of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the 
Purposes of Patent Procedure with the American Type Culture 
Collection (ATCC) , 10801 University Boulevard, Manassas, 
Virginia 20110-2209 on December 2, 1998 under the following 
Accession Nos. : ATCC Accession No. KB-12605 (PA8 ) , ATCC 
Accession No . HB- 12607 (PA10) , ATCC Accession No. HB-12608 
(PA11), ATCC Accession No. HB-12609 (PA12), and ATCC 
Accession No. HB-12610 (PA14) . 



As used herein, the following standard abbreviations are 
used throughout the specification to indicate specific 
15 amino acids: 



20 



25 



A=ala= 
N=asn= 
C=cys = 
E=glu= 
H=his= 
L=leu= 
M=met= 
P=pro= 
T=thr= 
Y=tyr= 
B=asx= 
Z=glx= 



alanine 

asparagine 

^cysteine 

glutamic acid 

histidine 

leucine 

methionine 

proline 

threonine 

tyrosine 



R=arg=arginine 
D=asp=aspartic acid 
Q=gln=glut amine 
G=gly=glycine 
I=ile=isoleucine 
K=lys=lysine 
F=phe=phenyl alanine 
S=ser=serine 
W=trp= tryptophan 
V=val =val ine 



asparagine or aspartic acid 
glutamine or glutamic acid 



30 



As used herein, the following standard abbreviations are 
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used throughout the specification to indicate specific 

nucleotides: C=cytosine; A=adenosine; T=thymidine; 
G= guanosine; and U=uracil . 

5 This invention provides a compound comprising the 
structure : 

6aYDINYYTSE3A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 

10 represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein cc represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order and 

15 have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; wherein p 
represents from 0 to 13 amino acids, with the proviso that 
if there are more than 2 amino acids, they are joined by 

20 peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO: 1 
beginning with the P at position 19 and extending therefrom 
in the carboxy terminal direction; 

wherein 6 represents an amino group or an acetylated amino 
25 group ; wherein A represents a carboxyl group or an amidated 
carboxyl group; wherein all of a , Y , D, I , N, Y, Y , T, S , E and (5 
are joined together by peptide bonds; further provided that 
at least two tyrosines in the compound are sulfated. 

30 In one embodiment of the above compound, the compound is 
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peptide which comprises consecutive amino acids having the 
sequence YDINYYTSE. 

In one embodiment of the above compound, the tyrosines at 
5 positions 1 and 5 of the sequence YDINYYTSE are sulfated. 

In one embodiment of the above compound, a represents less 
than 9 amino acids. In another embodiment of the above 
compound, a represents less than 8 amino acids. In another 

10 embodiment of the above compound, a represents less than 7 
amino acids. In another embodiment of the above compound, a 
represents less than 6 amino acids. In another embodiment 
of the above compound, a represents less than 5 amino 
acids. In another embodiment of the above compound, a 

15 represents less than 4 amino acids. In another embodiment 
of the above compound, a represents less than 3 amino 
acids. In another embodiment of the above compound, a 
represents less than 2 amino acids. In another embodiment 
of the above compound, a represents less than 1 amino acid. 

20 

In one embodiment of the above compound, 3 represents less 
than 17 amino acids. In one embodiment of the above 
compound, 3 represents less than 16 amino acids. In one 
embodiment of the above compound, 3 represents less than 15 

25 amino acids. In one embodiment of the above compound, 3 
represents less than 14 amino acids. In one embodiment of 
the above compound, 3 represents less than 13 amino acids. 
In one embodiment of the above compound, 3 represents less 
than 12 amino acids . In one embodiment of the above 

30 compound, 3 represents less than 11 amino acids. In one 



-27- 

embodiment of the above compound, 3 represents less than 10 
amino acids. In one embodiment of the above compound, (3 
represents less than 9 amino acids. In one embodiment of 
the above compound, 3 represents less than 8 amino acids. 
5 In one embodiment of the above compound, 3 represents less 
than 7 amino acids. In one embodiment of the above 
compound, 3 represents less than 6 amino acids. In one 
embodiment of the above compound, [3 represents less than 5 
amino acids. In one embodiment of the above compound, 3 
10 represents less than 4 amino acids. In one embodiment of 
the above compound, [3 represents less than 3 amino acids. 
In one embodiment of the above compound, (3 represents less 
than 2 amino acids. In one embodiment of the above 
compound, (3 represents less than 1 amino acid. 

15 

This invention also provides a compound comprising the 
structure : 

9 a YD I N Y YT S E 3 A 

wherein each T represents a threonine, each S represents a 
20 serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
25 they are joined by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; wherein 3 
represents from 0 to 333 amino acids, with the proviso that 
30 if there are more than 2 amino acids, they are joined by 
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peptide bonds in consecutive order and have a sequence 
identical to the sequence set forth in SEQ ID NO: 1 
beginning with the P at position 19 and extending therefrom 
in the carboxy terminal direction; 
5 wherein 6 represents an amino group or an acetylated amino 
group; wherein X represents a carboxy 1 group or an amidated 
carboxy 1 group; wherein all of of , Y , D, I , N, Y , Y, T , S , E and P 
are joined together by peptide bonds; further provided that 
at least two tyrosines in the compound are sulfated. 

10 

In the compounds described herein and as exemplified above, 
the P in each compound may alternatively represent from 0 
to 333 amino acids. 

15 In one embodiment of the compounds described herein, P 
represents less than 300 amino acids. In another embodiment 
of the above compound, p represents less than 250 amino 
acids. In another embodiment of the above compound, P 
represents less than 200 amino acids. In another embodiment 

20 of the above compound, P represents less than 150 amino 
acids. In another embodiment of the above compound, P 
represents less than 100 amino acids. In another embodiment 
of the above compound, P represents less than 75 amino 
acids. In another embodiment of the above compound, P 

25 represents less than 50 amino acids. In another embodiment 
of the above compound, P represents less than 4 0 amino 
acids. In another embodiment of the above compound, P 
represents less than 35 amino acids. In another embodiment 
of the above' compound, P represents less than 30 amino 

30 acids. In another embodiment of the above compound, P 



represents less than 25 amino acids. In another embodiment 
of the above compound, 3 represents less than 20 amino 
acids. In another embodiment of the above compound, 3 
represents less than 19 amino acids. In another embodiment 
of the above compound, 3 represents less than 18 amino 
acids. In another embodiment of the above compound, (3 
represents less than 17 amino acids. In another embodiment 
of the above compound, 3 represents less than 16 amino 
acids. In another embodiment of the above compound, 3 
represents less than 15 amino acids. In another embodiment 
of the above compound, 3 represents less than 14 amino 
acids. In another embodiment of the above compound, 3 
represents less than 13 amino acids. In another embodiment 
of the above compound, 3 represents less than 12 amino 
acids. In another embodiment of the above compound, 3 
represents less than 11 amino acids. 

In one embodiment of the above compound, a represents less 
than 9 amino acids. In another embodiment of the above 
compound, a represents less than 8 amino acids. In another 
embodiment of the above compound, a represents less than 7 
amino acids . In another embodiment of the above compound, a 
represents less than 6 amino acids. In another embodiment 
of the above compound, a represents less than 5 amino 
acids. In another embodiment of the above compound, a 
represents less than 4 amino acids. In another embodiment 
of the above compound, a represents less than 3 amino 
acids. In another embodiment of the above compound, a 
represents less than 2 amino acids. In another embodiment 
of the above compound, a represents less than 1 amino acid. 
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The CCR5 amino acid sequence is the following and is set 
forth in SEQ ID N0:1: 

1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSIiV 
41 FIFGFVGNMLVILILINCKRLKSMTDIYLIiNLAISDLFFL 

5 81 LTVPFWAKYAAAQWDFGNTMCOLLTGLYFIGFFSGIFFII 
121 LLTI DRYLAWHAVFALKARTVTFGWTS VI TWWAVFAS 
161 LPGI I FTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKI VI 

2 01 LfGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIFTI 

2 41 MI VYFL.FWAPYNI VLLLNTFQEFFGLNNCSSSNRLDQAMQ 
10 2 81 VTETLGMTHCCINPI I YAFVGEKFRNYLLVFFQKHI AKRF 

3 21 CKCCS I FQQEAPERASSVYTRSTGEQEI SVGLi 3 52 

The CCR5 nucleotide sequence is the following and is set 
forth in SEQ ID NO : 2 : 

15 1 GAATTCCCCC AACAGAGCCA AGCTCTCCAT CTAGTGGACA GGGAAGCTAG CAGCAAACCT 

61 TCCCTTCACT ACAAAACTTC ATTGCTTGGC CAAAAAGAGA GTTAATTCAA TGTAGACATC 
121 TATGTAGGCA ATTAAAAACC TATTGATGTA TAAAACAGTT TGCATTCATG GAGGGCAACT 
181 AAATACATTC TAGGACTTTA TAAAAG AT C A CTTTTTATTT ATGCACAGGG TGGAACAAGA 
241 TGGATTATCA AGTGTCAAGT CCAATCTATG ACATCAATTA TTATACATCG GAGCCCTGCC 

20 301 AAAAAATCAA TGTGAAGCAA ATCGCAGCCC GCCTCCTGCC TCCGCTCTAC TCACTGGTGT 
361 TCATCTTTGG TTTTGTGGGC AACATG CTGG TCATCCTCAT CCTGATAAAC TGCAAAAGGC 
421 TGAAGAGCAT GACTGACATC TACCTGCTCA ACCTGGCCAT CTCTGACCTG TTTTTCCTTC 

4 81 TTACTGTCCC CTTCTGGGCT CACTATG CTG CCGCCCAGTG GGACTTTGGA AATACAATGT 
541 GTCAACTCTT GACAGGGCTC TATTTTATAG GCTTCTTCTC TGGAATCTTC TTCATCATCC 

25 6 01 TCCTGACAAT CGATAGGTAC CTGGCTGTCG TCCATGCTGT GTTTGCTTTA AAAGCCAGGA 
661 CGGTCACCTT TGGGGTGGTG ACAAGTGTGA TCACTTGGGT GGTGGCTGTG TTTGCGTCTC 
721 TCCCAGGAAT CATCTTTACC AGATCTCAAA AAGAAGGTCT TCATTACACC TGC AG CTCTC 
7 81 ATTTTCCATA CAGTCAGTAT CAATTCTGGA AGAATTTCCA GACATTAAAG ATAGTCATCT 
841 TGGGGCTGGT CCTGCCGCTG CTTGTCATGG TCATCTG CTA CTCGGGAATC CTAAAAACTC 

30 901 TGCTTCGGTG TCGAAATGAG AAGAAGAGGC ACAGGGCTGT GAG G CTTATC TTCACCATCA 
961 TGATTGTTTA TTTTCTCTTC TGGGCTCCCT ACAACATTGT CCTTCTCCTG AACACCTTCC 
1021 AGGAATTCTT TGGCCTGAAT AATTGCAGTA GCTCTAACAG GTTGGACCAA GCTATGCAGG 
1081 TG A C AG AG AC TCTTGGGATG ACGCACTGCT GCATCAACCC CATCATCTAT GCCTTTGTCG 
1141 GGGAGAAGTT CAGAAACTAC CTCTTAGTCT TCTTCCAAAA GCACATTGCC AAACGCTTCT 
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12 01 GCAAATGCTG TTCTATTTTC CAGCAAGAGG CTCCCGAGCG AGCAAGCTCA GTTTACACCC 

12 61 GATCCACTGG GGAGCAGGAA ATATCTGTGG GCTTGTGACA CGGACTCAAG TGGGCTGGTG 

13 21 ACCCAGTCAG AGTTGTGCAC ATGGCTTAGT TTTCATACAC AGCCTGGGCT GGGGGT 

5 The YDINYYTSE sequence corresponds to amino acid residues 
10-18 of the CCR5 sequence set forth above. 

As used herein, "CCRB" is a chemokine receptor which binds 
members of the CC group of chemokines and whose amino acid 

10 sequence comprises that provided in Genbank Accession 
Number 1705896 and related polymorphic variants. The 
nucleotide sequence comprises that provided in Genbank 
Accession Number X91492. In one embodiment, the above 
compound may correspond to the extracellular portion of 

15 CCR5 . The first 31 amino acids of CCR5 correspond to the 
extracellular portion of CCR5 . Accordingly, the 
extracellular portion extends from the methionine at 
position number 1 to the arginine at position number 31 of 
SEQ ID N0:1. In another embodiment, the above compound may 

20 correspond to the amino- terminal portion of CCR5 . As used 
herein, "N- terminus" or amino- terminus means the sequence of 
amino acids spanning the initiating methionine and the 
first transmembrane region. 

25 As used herein, "H 2 N" refers to the N-terminus or amino- 
terminus. As used herein, "COOH" refers to the C-terminus 
or carboxy- terminus . 

Various tyrosines of the compounds described herein may be 
30 sulfated. These include but are not limited to the 
tyrosines at positions 3, 10 and 14 of amino acid sequence 
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set forth in SEQ ID NO:l. Accordingly, in one embodiment, 
the tyrosines at positions 10 and 14 are sulfated. In 
another embodiment, the tyrosines at positions 3 and 14 are 
sulfated. In another embodiment, the tyrosines at positions 
5 3 and 10 are sulfated. In another embodiment, the tyrosines 
at positions 3, 10 and 14 are sulfated. Other tyrosines in 
the sequence set forth in SEQ ID NO : 1 may also be sulfated. 

This invention provides a composition comprising one of the 
10 compounds described, herein and a detectable marker attached 
thereto. In one embodiment of the composition, the 
detectable marker is biotin. In one embodiment of the 
composition, the detectable marker is attached at the C- 
terminus of the compound. 

15 

The compounds of the subject invention may also be isolated 
or purified. In one embodiment the compound is labeled with 
a detectable marker. As used herein, chemical "labels" 
include radioactive isotopes, fluorescent groups and 
20 affinity moieties such as biotin that facilitate detection 
of the labeled peptide. Other chemical labels are well- 
known to those skilled in the art. Methods for attaching 
chemical labels to peptides are well-known to the skilled 
artisan . 

25 

As used herein, "peptide" and "polypeptide" are used to 
denote two or more amino acids linked by a peptidic bond 
between the a-carboxyl group of one amino acid and the a- 
amino group of the next amino acid. Peptides may be 
30 produced by solid-phase synthetic methods that are well- 
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known to those skilled in the art. In addition to the above 
set of twenty amino acids that are used for protein 
synthesis in vivo, peptides may contain additional amino 
acids, including but not limited to hydroxyproline , 
5 sarcosine, and ycarboxyglutamate . The peptides may contain 
modifying groups including but not limited to sulfate and 
phosphate moieties. Peptides can be comprised of L- or D- 
amino acids, which are mirror- image forms with differing 
optical properties. Peptides containing D-amino acids have 
10 the advantage of being less susceptible to proteolysis in 
vivo . 

Peptides may by synthesized in monomeric linear form, 
cyclized form or as oligomers such as branched multiple 

15 antigen peptide (MAP) dendrimers (Tarn et al . Biopolymers 
51:311, 1999). Nonlinear peptides may have increased 
binding affinity by virtue of their restricted 
conformations and/or oligomeric nature. Peptides may also 
be produced using recombinant methods as either isolated 

20 peptides or as a portion of a larger fusion protein that 
contains additional amino acid sequences . 

Peptides may be chemically conjugated to proteins by a 
variety of well-known methods. Such peptide-protein 

25 conjugates can be formulated with a suitable adjuvant and 
administered parenterally for the purposes of generating 
polyclonal and monoclonal antibodies to the peptides of 
interest. Alternatively, unconjugated peptides can be 

formulated with adjuvant and administered to laboratory 

30 animals for the purposes of generating antibodies. Methods 
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for generating and isolating such antibodies are well-known 
to those skilled in the art. 

This invention provides derivatives of the above compound. 
As used herein, a "derivative" peptide is one whose amino 
acid sequence is nonidentical to the reference peptide but 
which possesses functionally similar binding properties. 
Derivative peptides may also contain N-terminal, C-terminal 
and/or internal insertions, deletions, or substitutions of 
amino acids, with the proviso that such insertions, 
deletions and substitutions do not abrogate the binding 
properties of the peptide. Derivative peptides include 
peptides modified with chemical labels to facilitate 
detection. Derivative peptides include branched and 
cyclized peptides . 

As used herein, "sulf opept ides" are peptides that contain 
sulfate moieties attached to one or more amino acids, such 
as tyrosine. In "sulf o- tyrosines" , a sulfate group replaces 
the para-hydroxyl group located on tyrosine side- chain. 

As used herein, "phosphopept ides" are peptides that contain 
phosphate moieties attached to one or more amino acids, 
such a tyrosine. In "phospho- tyrosines", a phosphate group 
replaces the para- hydroxy 1 group located on tyrosine side- 
chain. 

The peptides of the subject invention may be sulfated when 
synthesized or they may be subsequently sulfated. For 
example, means of sulfating the peptides include chemical 
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sulfation or enzymatic sulfation. One skilled in the art 
would know how to employ these and other techniques to 
sulfate the compound. 

5 This invention provides a composition which comprises a 
carrier and an amount of one of the compounds described 
herein effective to inhibit binding of HIV-1 to a CCR5 
receptor on the surface of a CD4+ cell. 

10 The carriers include but are not limited to an aerosol, 
intravenous, oral or topical carrier. Accordingly. The 
invention provides the above composition adapted for 
aerosol, intravenous, oral or topical application. 

15 This invention provides the above compositions and a 
pharmaceutically acceptable carrier. Pharmaceutically 
acceptable carriers are well known to those skilled in the 
art. Such pharmaceutically acceptable carriers may include 
but are not limited to aqueous or non-aqueous solutions, 

20 suspensions, and emulsions. Examples of non-aqueous 

solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic 
esters such as ethyl oleate . Aqueous carriers include 
water, alcoholic/aqueous solutions, emulsions or 

25 suspensions, saline and buffered media. Parenteral 
vehicles include sodium chloride solution, Ringer's 
dextrose, dextrose and sodium chloride, lactated Ringer's 
or fixed oils. Intravenous vehicles include fluid and 
nutrient replenishers , electrolyte replenishers such as 

30 those based on Ringer's dextrose, and the like. 




-36- 

Preservatives and other additives may also be present, such 
as, for example, antimicrobials, antioxidants, chelating 
agents, inert gases and the like. 

5 As used herein, "composition" means a mixture. The 
compositions include but are not limited to those suitable 
for oral, rectal, intravaginal , topical, nasal, opthalmic, 
or parenteral administration to a subject. As used herein, 
"parenteral" includes but is not limited to subcutaneous, 
10 intravenous, intramuscular, or intrasternal injections or 
infusion techniques. ; 

! 

i 

As used herein, "administering" may be effected or performed 
using any of the methods known to one skilled in the art . 

15 The methods may comprise intravenous, intramuscular or 
subcutaneous means. As used herein, "effective dose" means 
an amount in sufficient quantities to either treat the 
subject or prevent the subject from becoming infected with 
HIV-1. A person of ordinary skill in the art can perform 

20 simple titration experiments to determine what amount is 
required to treat the subject. 

This invention provides a method of inhibiting human 
immunodeficiency virus infection of a CD4+ cell which also 

25 carries a CCR5 receptor on its surface which comprises 
contacting the CD4+ cell with an amount of one of the 
compounds described herein effective to inhibit binding of 
human immunodeficiency virus to the CCR5 receptor so as to 
thereby inhibit human immunodeficiency virus infection of 

30 the CD4+ cell. As used herein, "inhibits" means that the 
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amount is reduced. In a preferred embodiment, inhibits 
means that the amount is reduced 100%, 

In one embodiment of this method, the CD4+ cell is present 
5 in a subject and the contacting is effected by 
administering the compound to the subject. 

This invention provides a method of preventing CD4 + cells 
of a subject ■ from becoming infected with human 
immunodeficiency virus which comprises administering to the 
subject an amount of one of the compounds described herein 
effective to inhibit binding of human immunodeficiency 
virus to CCR5 receptors on the surface of the CD4 + cells so 
as to thereby prevent the subject's CD4 + cells from 
becoming infected with human immunodeficiency virus. 

This invention provides a method of treating a subject 
whose CD4+ cells are infected with human immunodeficiency 
virus which comprises administering to the subject an 
20 amount of one of the compounds described herein effective 
to inhibit binding of human immunodeficiency virus to CCR5 
receptors on the surface of the subject's CD4+ cells so as 
to thereby treat the subject. 

25 As used herein, human immunodeficiency virus includes but 
is not limited to HIV-1, which is the human 
immunodeficiency virus type-1. HIV-1 includes but is not 
limited to extracellular virus particles and the forms of 
HIV-1 found in HIV-1 infected cells. 
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As used herein, "HIV-1 infection" means the introduction of 
HIV-1 genetic information into a target cell, such as by 
fusion of the target cell membrane with HIV-1 or an HIV-1 
envelope glycoprotein* cell. The target cell may be a bodily 
5 cell of a subject. In the preferred embodiment, the target 
cell is a bodily cell from a human subject. 

As used herein, "inhibiting HIV-1 infection" means the 
reduction of the amount of HIV-1 genetic information 
10 introduced into a target cell population as compared to the 
amount that would be introduced without the composition. 

i 

In the above methods, the compound may be administered by 
various routes including but not limited to aerosol, 

15 intravenous, oral or topical route. The administration may 
comprise intralesional , intraperitoneal, intramuscular or 
intravenous inj ection; infusion; liposome-mediated 

delivery; topical, intrathecal, gingival pocket, per 
rectum, intrabronchial , nasal, oral, ocular or otic 

20 delivery. In a further embodiment, the administration 
includes intrabronchial administration, anal, intrathecal 
administration or transdermal delivery. In another 
embodiment, the compound is administered hourly, daily, 
weekly, monthly or annually. In another embodiment, the 

25 effective amount of the compound comprises from about 
0.000001 mg/kg body weight to about 100 mg/kg body weight. 

The administration may be constant for a certain period of 
time or periodic and at specific intervals. The compound 
30 may be delivered hourly, daily, weekly, monthly, yearly 
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(e.g. in a time release form) or as a one time delivery. 
The delivery may be continuous delivery for a period of 
time, e.g. intravenous delivery. 

5 The carrier may be a diluent, an aerosol, a topical 
carrier, an aqeuous solution, a nonaqueous solution or a 
solid carrier. 

The effective amount of the compound may comprise from 

10 about 0.000001 mg/kg body weight to about 100 mg/kg body 
weight. In one embodiment, the effective amount may 

comprise from about 0.001 mg/kg body weight to about SO 
mg/kg body weight. In another embodiment, the effective 
amount may range from about 0.01 mg/kg body weight to about 

15 10 mg/kg body weight. The actual effective amount will be 
based upon the size of the compound, the biodegradabili ty 
of the compound, the bioactivity of the compound and the 
bioavailability of the compound. If the compound does not 
degrade quickly, is bioavailable and highly active, a 

20 smaller amount will be required to be effective. The 
effective amount will be known to one of skill in the art; 
it will also be dependent upon the form of the compound, 
the size of the compound and the bioactivity of the 
compound. One of skill in the art could routinely perform 

25 empirical activity tests for a compound to determine the 
bioactivity in bioassays and thus determine the effective 
amount . 

The compound of the present invention may be delivered 
30 locally via a capsule which allows sustained release of the 
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agent or the peptide over a period of time. Controlled or 
sustained release compositions include formulation in 
lipophilic depots (e.g., fatty acids, waxes, oils). Also 
comprehended by the invention are particulate compositions 
5 coated with polymers (e.g., poloxamers or poloxamines) and 
the agent coupled to antibodies directed against tissue- 
specific receptors, ligands or antigens or coupled to 
ligands of tissue- specif ic receptors. Other embodiments of 
the compositions of the invention incorporate particulate 
10 forms protective coatings, protease inhibitors or 
permeation enhancers for various routes of administration, 
including parenteral, pulmonary, nasal and oral. 

In one embodiment of the above methods, the subject is 
15 infected with HIV-1 prior to administering the compound to 
the subject. In one embodiment of the above methods, the 
subject is not infected with HIV-1 prior to administering 
the compound to the subject. In one embodiment of the above 
methods, the subject is not infected with, but has been 
20 exposed to, human immunodeficiency virus. 

In one embodiment of the above methods, the effective 
amount of the compound comprises from about 1.0 ng/kg to 
about 100 mg/kg body weight of the subject. In another 

25 embodiment of the above methods, the effective amount of 
the compound comprises from about 100 ng/kg to about 50 
mg/kg body weight of the subject. In another embodiment of 
the above methods, the effective amount of the compound 
comprises from about 1 ^9/^9 to about 10 mg/kg body weight 

30 of the subject. In another embodiment of the above methods, 
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the effective amount of the compound comprises from about 
100 ptg/kg to about 1 mg/kg body weight of the subject. 

The dose of the composition of the invention will vary 
5 depending on the subject and upon the particular route of 
administration used. Dosages can range from 0.1 to 100,000 
^tg/kg. Based upon the composition, the dose can be 
delivered continuously, such as by continuous pump, or at 
periodic intervals. For example, on one or more separate 
10 occasions. Desired time intervals of multiple doses of a 
particular composition can be determined without undue 
experimentation by one skilled in the art. 

As used herein, "effective dose" means an amount in 
sufficient quantities to either treat the subject or 
prevent the subject from becoming infected with HIV-1. A 
person of ordinary skill in the art can perform simple 
titration experiments to determine what amount is required 
to treat the subject. 

In one embodiment of the above method, the subject is a 
human being. As used herein, "subject" means any animal or 
artificially modified animal capable of becoming HIV- 
infected. Artificially modified animals include, but are 
not limited to, SCID mice with human immune systems. The 
subjects include but are not limited to mice, rats, dogs, 
guinea pigs, ferrets, rabbits, and primates. In the 
preferred embodiment, the subject is a human being. 

30 -This invention provides a vaccine which comprises the 
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compound described herein. Vaccines comprising the 
sulf opeptides and a suitable adjuvant could be administered 
to a subject for the purposes of generating antibodies or 
other immune responses that are of therapeutic or 
5 prophylactic value. For example, the vaccines could be 
administered for the purpose of generating in the subject 
antibodies that bind CCR5 and inhibit its ability to 
mediate HIV entry and infection, thereby protecting the 
subject from HIV infection or disease progression. The 
10 vaccines may also comprise a suitable adjuvant. The vaccine 
may also comprises a suitable carrier. 

The subject invention has various applications which 
includes HIV treatment such as treating a subject who has 

15 become afflicted with HIV. As used herein, "afflicted with 
HIV-l" means that the subject has at least one cell which 
has been infected by HIV-l. As used herein, "treating" 
means either slowing, stopping or reversing the progression 
of an HIV-l disorder. In the preferred embodiment, 

20 "treating" means reversing the progression to the point of 
eliminating the disorder. As used herein, "treating" also 
means the reduction of the number of viral infections, 
reduction of the number of infectious viral particles, 
reduction of the number of virally infected cells, or the 

25 amelioration of symptoms associated with HIV-l. Another 
application of the subject invention is to prevent a 
subject from contracting HIV. As used herein, "contracting 
HIV-l" means becoming infected with HIV-l, whose genetic 
information replicates in and/or incorporates into the host 

30 pells. Another application of the subject invention is to 
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treat a subject who has become infected with HIV-1. As used 
herein, n HIV-l infection" means the introduction of HIV-1 
genetic information into a target cell, such as by fusion 
of the target cell membrane with HIV-1 or an HIV-1 envelope 
glycoprotein- 4 " cell. The target cell may be a bodily cell of 
a subject. In the preferred embodiment, the target cell is 
a bodily cell from a human subject. Another application of 
the subject invention is to inhibit HIV-1 infection. As 
used herein, "inhibiting HIV-1 infection" means reducing 
the amount of HIV-1 genetic information introduced into a 
target cell population as compared to the amount that would 
be introduced without said composition. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

(a) immobilizing one of the compounds described herein on 
a solid support; 

(b) contacting the immobilized compound from step (a) with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the immobilized 
compound under conditions permitting binding of the 
CCR5 ligand to the immobilized compound so as to form 
a complex; 

(c) removing any unbound CCR5 ligand ; 

(d) contacting the complex from step (b) with the agent; 
and 

(e) detecting whether any CCR5 ligand is displaced from 
the complex, wherein displacement of detectable CCR5 
ligand from the complex indicates that the agent binds 
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to the compound so as to thereby identify the agent as 
one which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 

This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises: 

(a) contacting one of the compounds described herein with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the compound 
under conditions permitting binding of the CCR5 ligand 
to the compound so as to form a complex ; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of CCR5 ligand which is bound to 
the compound in the complex; 

(d) contacting the complex from step (a) with the agent so 
as to displace CCR5 ligand from the complex; 

(e) measuring the amount of CCR5 ligand which is bound to 
the compound in the presence of the agent; and 

(f) comparing the amount of CCR5 ligand bound to the 
compound in step (e) with the amount measured in step 
(c) , wherein a reduced amount measured in step (e) 
indicates that the agent binds to the compound so as 
to thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCR5 
receptor which comprises: 

(a) immobilizing one of the compounds described herein on 
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a solid support ; 

(b) contacting the immobilised compound from step (a) with 
the agent and detectable CCR5 ligand under conditions 
permitting binding of the CCR5 ligand to the 
immobilized compound so as to form a complex; 

(c) removing any unbound CCR5 ligand; 

(d) measuring the amount of detectable CCR5 ligand which 
is bound to the immobilized compound in the complex; 

(e) measuring the amount of detectable CCR5 ligand which 
binds to the immobilized compound in the absence of 
the agent ,- 

(f) comparing the amount of CCR5 ligand which is bound to 
the immobilized compound in step (e) with the amount 
measured in step (d) , wherein a reduced amount 
measure d in step (d) indicates that the agent binds to 
the compound so as to thereby identify the agent as 
one which inhibits binding of the CCR5 ligand to the 
CCR5 receptor . 

In one embodiment of the above method, the amount of the 
detectable CCR5 ligand in step (a) and step (e) is 
sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 

This invention also provides a method of identifying an 
agent which inhibits binding of a CCR5 ligand to a CCR5 
receptor which comprises: 

(a) contacting one of the compounds described herein with 
the agent and detectable CCR5 ligand under conditions 
permitting binding of the CCR5 ligand to the compound 



so as to form a complex; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of detectable CCR5 ligand which 
is bound to the compound in the complex; 

(d) measuring the amount of detectable CCR5 ligand which 
binds to the compound in the absence of the agent; 

(e) comparing the amount of CCR5 ligand which is bound to 
the compound in step (c) with the amount measured in 
step (d) , wherein a reduced amount measured in step 
(c) indicates that the agent binds to the compound so 
as to thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

In one embodiment of the above method, the amount of the 
detectable CCR5 ligand in step (a) and step (d) is 
sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 

In one embodiment of the above method the solid support is 
a microtiter plate well. In another embodiment, the solid 
support is a bead. In a further embodiment, the solid 
support is a surface plasmon resonance sensor chip. The 
surface plasmon resonance sensor chip can have pre- 
immobilized streptavidin . In one embodiment, the surface 
plasmon resonance sensor chip is a BIAcore™ chip. 

In one embodiment of the above methods, the detectable CCR5 
ligand is labeled with a detectable marker. In another 
embodiment of the above methods, the CCR5 ligand is 
detected by contacting it with another compound which is 



WU Ul/04 /JU 

-47- - 

both capable of detecting the CCR5 ligand and is 
detectable. The detectable markers include those described 
above . 

5 This invention provides a method of identifying an agent 
which inhibits binding of a CCR5 ligand to a CCR5 receptor 
which comprises : 

a) immobilizing one of the compounds described 
herein on a solid support ; 
10 b) contacting the immobilized compound from step a) 

with the agent dissolved or suspended in a known 
vehicle and measuring the binding signal 
generated by such contact; 

c) contacting the immobilized compound from step a) 
15 with the known vehicle in the absence of the 

compound and measuring the binding signal 
generated by such contact ; 

d) comparing the binding signal measured in step b) 
with the binding signal measured in step c) , 

20 wherein an increased amount measured in step b) 

indicates that the agent binds to the compound so 
as to thereby identify the agent as one which 
binds to the CCR5 receptor. 

25 In one embodiment of the above method, the solid support is 
a surface plasmon resonance sensor chip. In another 
embodiment of the above method, the binding signal is 
measured by surface plasmon resonance. 



30 This invention provides a method of obtaining a composition 
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which comprises: 

(a) identifying a compound which inhibits binding of a 
CCR5 ligand to a CCR5 receptor according to one of the 
above methods ; and 

(b) admixing the compound so identified or a homolog or 
derivative thereof with a carrier. 

The invention provides agents identified in the screen. 
Such agents may have utility in treating HIV-1 infection or 
other CCR5 -mediated diseases, which include rheumatoid 
arthritis, asthma, multiple sclerosis, psoriasis, 
atherosclerosis and other inflammatory diseases. 

In one embodiment of the above methods, the CCR5 ligand is 
a complex comprising an HIV-1 envelope glycoprotein and a 
CD4 -based protein. The HIV-1 envelope glycoproteins include 
but are not limited to gpl20, gpl40 or gpl60. The CD4 -based 
proteins include but are not limited to soluble CD4 or CD4- 
IgG2 . 

As used herein, "CD 4 " means the mature, native, membrane - 
bound CD4 protein comprising a cytoplasmic domain, a 
hydrophobic transmembrane domain, and an extracellular 
domain that binds to the HIV-1 gp!20 envelope glycoprotein. 
As used herein, "HIV-1 envelope glycoprotein" means the HIV- 
1 encoded protein which comprises the gpl20 surface 
protein, the gp41 transmembrane protein and oligomers and 
precursors thereof. As used herein, "CD4-based protein" 
means any protein comprising at least one sequence of amino 
acid residues corresponding to that portion of CD4 which is 



-49- 

reguired for CD 4 to form a complex with the HIV-1 gpl20 
envelope glycoprotein. As used herein, "CD4-IgG2" means a 
heterotetrameric CD4 -human IgG2 fusion protein encoded by 
the expression vectors deposited under ATCC Accession 
5 Numbers 75193 and 75194. 

In one embodiment of the above methods, the CCR5 ligand is 
a chemokine . The chemokines include but are not limited to 
RANTES, MlP-la or MIP-1J3. As used herein, "RANTES", "MIP-loc", 

10 and "MIP-iP" denote members of the chemokine super family of 
proteins that direct the activation and migration of 
leukocytes and other cells involved in the inflammation. 
RANTES, MlP-lof and MIP-lp are known to bind CCR5 and induce 
signaling. Their peptide sequences have been described 

15 (Wells et al . J. Leukocyte Biology, 59:53-60, 1996). 

In one embodiment of the above methods, the CCR5 ligand is 
an antibody. In one embodiment, the antibody is PA8 (ATCC 
Accession No. HB-12605). In another embodiment, the 
20 antibody is PA10 (ATCC Accession No. 12607). In another 
embodiment, the antibody is PA11 (ATCC Accession No. HB- 
12608) . In another embodiment, the antibody is PA12 (ATCC 
Accession No. HB-12609) . 

25 This invention provides a compound having the structure: 

A- (aYDINYYTSEpA) n 

wherein each T represents a threonine, each S represents a 
serine, each .E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
30 each I represents an isoleucine; and each N represents an 
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aspai'agine ; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
5 forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein (3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 

10 have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein K 
represents a carboxyl group or an amidated carboxyl group; 
wherein all of a , Y, D, I , N, Y, Y , T, S , E and p are joined 

15 together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to 8 , A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

20 

This invention also provides a compound having the 
structure: 

OaYDINYYTSEp) n ~A 

wherein each T represents a threonine, each S represents a 
25 serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
30 they are joined together by peptide bonds in consecutive 
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order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein (3 represents from 0 to 13 amino acids, with the 
5 proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein 8 
10 represents an amino group or an acetyl at ed amino group; 
wherein all of oc, Y , D, I , N, Y, Y, T, S , E and 3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, wherein n is an integer from 1 to 8, 
15 A is a polymer, and the solid line represents up to 8 
linkers which attach the structure in parentheses to A. 

This invention provides a compound having the structure: 
A - (aYDINYYTSE3X) n 

20 wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 

25 with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction ; 

30 wherein (3 represents from 0 to 333 amino acids, with the 
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proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
5 therefrom in the carboxy terminal direction; wherein A 
represents a carboxyl group or an amidated carboxyl group; 
wherein all of a , Y, D , I , N, Y, Y , T, S , E and (3 are joined 
together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
10 integer from 1 to 8 , A is a polymer, and the solid line 
represents up to 8 linkers which attach; the structure in 
parentheses to A. 

This invention also provides a compound having the 
15 structure : 

( 0aYDINYYTSE(3) n - A 

wherein each T represents a threonine, each S represents a 
serine, each E represents. a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 

20 each I represents an isoleucine; and each N represents an 
asparagine; wherein a represents from 0 to 9 amino acids, 
with the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in consecutive 
order and have a sequence identical to the sequence set 

25 forth in SEQ ID NO : 1 beginning with the I at position 9 
and extending therefrom in the amino terminal direction; 
wherein (3 represents from 0 to 333 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 

30 have a sequence identical to the sequence set forth in SEQ 
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ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; wherein 6 
represents an amino group or an acetylated amino group; 
wherein all of a, Y, D, I , N, Y, Y, T, S , E and (5 are joined 
5 together by peptide bonds, further provided that at least 
two tyrosines in the compound are sulfated, wherein n is an 
integer from 1 to 8 , A is a polymer, and the solid line 
represents up to 8 linkers which attach the structure in 
parentheses to A. 

10 

The polymer of the above compounds includes but is not 
limited to the following: a linear lysine polymer; a 
branched lysine polymers; a linear arginine polymer; a 
branched arginine polymer; and polyethylene glycol (PEG) , a 
15 linear acetylated lysine polymer, a branched acetylated 
lysine polymer, a linear chloroacetylated lysine polymer 
and a branched chloroacetylated lysine polymer. 

The above compounds can be produced by various methods 
20 known to those skilled in the art, including but not 
limited to the following. Methods for producing synthetic 
multimeric peptides such as multiple antigen peptides, 
synthetic polymeric constructs, and branched lysine 
oligopeptides are well known to those skilled in the art 
25 (Spetzler and Tarn, Int. J. Pept . Prot . Res. 45:78, 1995; 
Yai et al., J. Virol., 69:320, 1995; Okuda et al . , J. Mol . 
Recognit. 6:101, 1993). For example, radially branched 
peptides can be produced by performing standard solid-phase 
peptide synthesis methods using branched lysine skeletons 
30 on 4- (oxy-methyl) -phenylactamidomethyl or other suitable 
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solid resin. Peptide chains are elongated in parallel in a 
stepwise fashion using optimized t -butyl oxycarbonyl /benzyl 
chemistry as described (Sabatier et al . , Biochemistry 
32:2763, 1993). Peptides are liberated from the resin, 
5 purified by reversed-phase chromatography over a C18 or 
other suitable column and characterized by analytical HPLC 
and mass spectroscopy. In another approach, monomeric 
peptides are synthesized, purified, and then covalently 
coupled to lysine copolymers using N- succinimidyl maleimido 

10 carboxylate chemistry. In another approach, the peptides 
can also be made in the form of affinity type multimers. 
For example, peptides may be synthesized with an affinity 
tag such as biotin. These affinity tagged peptides can then 
be mixed with affinity ligands capable of binding 

15 multimerically , such as streptravidin . Other site- specif ic 
ligation chemistries are known to the skilled artisan. 

This invention provides a compound comprising the 
structure: 
20 6aYDnnYnnnE(3A 

wherein each E represents a glutamic acid, each D 
represents an aspartic acid, and each Y represents a 
tyrosine ; 

wherein a represents from 0 to 9 amino acids, with the 
25 proviso that if there are more than 2 amino acids, they are 
joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 
30 wherein (3 represents from 0 to 13 amino acids, with the 
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proviso that if there are more than 2 amino acids, they are 
joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the P at position 19 and extending 
5 therefrom in the carboxy terminal direction; 

wherein 9 represents an amino group or an acetylated amino 
group; wherein X represents a carboxy 1 group or an amidated 
carboxyl group ,- 

wherein n represents any amino acid, 
10 wherein all of a, Y, D, n, n, Y, n, n, n, E and (3 are joined 
together by peptide bonds; 

further provided that at least two tyrosines in the 
compound are sulfated. 

15 In one embodiment of this compound, the compound comprises 
amino acids in addition to those in the YDnnYnnnE peptide, 
and such amino acids correspond to those present in the 
CCR5 receptor sequence set forth in SEQ ID NO : 1 , yet an 
amino acid may be replaced with a homologous amino acid. 

20 The sequence YDnnYnnnE corresponds to amino acid residues 
10-18 of the sequence set forth in SEQ ID NO:l. For 
example, if the peptide has one additional amino acid on 
its N terminal end, then the sequence could be IYDnnYnnnE 
or alternatively, the I could be replaced with G, A, V or 

25 L. 

In one embodiment of the above compound, the compound is a 
peptide which comprises consecutive amino acids having the 
sequence YDnnYnnnE. 

30 
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In one embodiment of the above compound, the tyrosines at 
positions 1 and 5 of the sequence YDnriYnnnE are sulfated. 

As used herein, "homologous amino acids" are those which 
5 have chemically similar side chains. For example, aliphatic 
side chains (G, A, V, L and I) ; aromatic side chains (F, Y 
and W) ; basic aide chains (K, R and H) ; acidic side chains 
(D and E) ; amide side chains (N and Q) ; aliphatic hydroxyl- 
containing side chains (S and T) ; sulfur-containing side 

10 chains (C and M) . Homology between amino acids may also be 
drawn on other bases, such as size, polarity, hydrogen 
bonding potential, hydrophilicity and hydrophobicity . 
Proline differs from the above amino acids in that it 
contains a secondary rather than primary imino group. 

15 Accordingly, proline may be considered an imino group. 
Substitution or proline with another amino acid (e.g. G, A 
or S) can increase the flexibility of a peptide. 
Conversely, substitution of another amino acid with a 
proline can stabilize a desired conformation. 

20 

This invention provides a compound comprising the 

structure : 

9aYDINYYTSE(BX 

wherein each T represents a threonine, each S represents a 
25 serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine ; 

wherein a represents from 0 to 9 amino acids, with the 
30 proviso that if there are more than 2 amino acids, they are 
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joined by peptide bonds in consecutive order and have a 
se q UenC e identical to the sequence set forth in SEQ ID NO: 
1 beginning with the I at position 9 and extending 
therefrom in the amino terminal dix'ection; 
5 wherein p represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined by peptide bonds in consecutive order and have a 
sequence identical to the sequence set forth in SEQ ID NO: 
1 beginning with the P at position 19 and extending 
10 therefrom in the carboxy terminal direction; 

wherein G represents an amino group or an acetylated amino 
group; wherein A represents a carboxy 1 group or an amidated 
carboxyl group ; 

wherein all of a , Y , D , I , N, Y , Y , T , S , E and (3 are joined 
15 together by peptide bonds; 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein any amino acid except for the Y at position 1, D at 
position 2, Y at position 5 and E at position 9 may be 
20 replaced with a homologous amino acid. 

In one embodiment of the above compound, with respect to 
replacing homologous amino acids, any I amino acid residue 
may be replaced with a G,A,V or L amino acid residue. In 

25 one embodiment of the above compound, any N amino acid 
residue may be replaced with a Q amino acid residue. In one 
embodiment of the above compound, any Y amino acid residue 
may be replaced with a F or W amino acid residue. In one 
embodiment of the above compound, any T amino acid residue 

30 may be replaced with a S amino acid residue. In one 
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embodiment of the above compound, any S amino acid residue 
may be replaced with a T amino acid residue. In one 
embodiment of the above compound, any C may be replaced 
with M,S,T ; A,G,N, or Q. 

5 

In one embodiment, a C amino acid residue within the £ 
region of the compound may be replaced with any other amino 
acid . 

10 This invention provides an agent which binds to an epitope 
of HIV-1 gpl20, which epitope comprises amino acid residues 
R298, N301, T303, 1322, D324, 1325, R326, 1420, K421, Q422, 
W427, thereby inhibiting binding of HIV-1 gpl20 to a CCR5 
chemokine receptor . 

15 

The above amino acid numbering is per HIV-1 strain HxB2 
(Genbank Accession No. AAB50262). Amino acids D324, 1325 
and R326 are derived from HIV-1 strain JR-FLi (Genbank 
Accession No. AAB05604) . 

20 



25 



30 



The amino acid sequence (SEQ ID NO: 17) for HIV-1 HxB2 gpl20 
is set forth below: 

1 MRVKEKYQHL WRWGWRVJGTM liLGMLMICSA TEKLWVTVYY GVPVWKEATT TLFCASDAKA 
61 YDTEVHNVWA THACVPTDPN PQEWLVNVT ENFNMWKNDM VEQMHEDIIS LWDQSLKPCV 
121 KLTPLrCVSLK CTDLKNDTNT NSSSGRMIME KGEIKNCSFN ISTSIRGKVQ KEYAFFYKLD 
181 IIPIDNDTTS YKLTSCNTSV ITQACPKVSF EPIPIHYCAP AGFAILKCNN KTFNGTGPCT 
241 NVSTVQCTHG IRPWSTQLL LNGSLAEEEV VIRSVNFTDN AKTIIVQLNT SVEINCTRPN 
3 01 NNTRKRIRIQ RGPGRAFVTI GKIGNMRQAH CNISRAKWNN TLiKQIASKLR EQFGNNKTI I 

3 61 FKQSSGGDPE IVTHSFNCGG EFFYCNSTQL FNSTWFNSTW STEGSNNTEG SDTITLPCRI 
421 KQIINMWQKV GKAMYAPPIS GQIRCSSNIT GLLLTRDGGN SNNESEIFRP GGGDMRDNWR 

4 81 SELYKYKWK I EPLGVAPTK AKRRWQREK R 

The amino acid sequence (SEQ ID NO: 16) for HIV-1 JR-FL 
gpl20 is set forth below: 

1 MRVKGIRKSY QYLWKGGTLIi LGILMICSAV EKLWVTVYYG VPVWKEATTT LFCASDAKAY 
61 DTEVHNVWAT HACVPTDPNP QEWLENVTE KFNMWKNNMV EQMQEDIISli WDQSLKPCVK 
121 LTPLCVTLNC KDVNATNTTN DSEGTMERGE IKNCSFNITT SIRDEVQKEY ALFYKLDWP 
181 IDNKNTSYRL ISCDTSVITQ ACPKISFEPI P I H Y CAP AG F AILKCNDKTF NGKGPCKNVS 
241 TVQCTHGIRP WSTQLLLNG SLAEEEWIR SDNFTNNAKT IIVQLKESVE INCTRPNNNT 
3 01 RKSIHIGPGR AFYTTGE 1 1 G DIRQAKCNIS RAKWNDTL.KQ IVIKLREQFE NKTIVFNHSS 

3 61 GGDPEIVMHS FNCGGEFFYC NSTQLFNSTW NNNTEGSNNT EGNTITLPCR IKQIINMWQE 
421 VGKAMYAPPI RGQIRCSSNI TGLLL.TRDGG INENGTEIFR PGGGDMRDNW RSELYKYKW 

4 81 KI EPLGVAPT KAKRRWQRE KR 

This invention provides the above agent, wherein the 
epitope is altered or masked by an alanine substitution of 
at least one of the amino acid residues R298, N301, T303, 
1322, D324, 1325, R326, 1420, K421, Q422 and W427 . 

This invention provides an agent which binds to an epitope 
of HIV-1 gpl20, which epitope comprises amino acid residues 
R238, N301, T303, 1322, D324, 1325, R326, 1420, K421, Q422, 
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W427, thereby inhibiting HIV-1 infection of a CD4 + CCR5+ 
cell. 

This invention provides the above agent, wherein the 
5 epitope is altered or masked by an alanine substitution of 
at least one of the amino acid residues R298, N301, T303, 
1322, D324, 1325, R326, 1420, K421, Q422 and W427 . 

In one embodiment of any of the above agents, the agent is 
10 a peptide. In one embodiment of any of the above agents, 
the peptide comprises consecutive amino, acids having the 
sequence YDINYYTSE. In one embodiment at least two 
tyrosines in the compound are sulfated. In one embodiment, 
the tyrosines at positions 1 and 5 of the sequence 
15 YDINYYTSE are sulfated. 

In one embodiment of any of the above agents, the agent is 
an antibody or portion of an antibody . In one embodiment of 
any of the above agents, the agent is a nonpeptidyl agent. 
20 In one embodiment of ' any of the above agents, the agent is 
a peptidyl agent . 

This invention provides a method of inhibiting HIV-1 
infection of a CD4+CCR5+ cell which comprises contacting 
25 the cell with an amount of an agent of the subject 
invention effective to bind to HIV-1 gp!20, so as to 
thereby inhibit HIV-1 infection of the CD4+ CCR5+ cell. 



30 



This invention provides a compound having one of the 
following structures : 
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A- (aYDINYYTSEPX) , ( 6aYDINYYTSE(3 ) ~A, or A- (aYDINYYTSE{3) — A 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
5 each I represents an isoleucine; and each N represents an 
asparagine ; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
10 have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein (3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
15 joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; 

wherein A represents a carboxyl group or an amidated 
20 carboxyl group ; 

wherein 0 represents an amino group or an acetylated amino 
group ; 

wherein all of a, Y, D, I , N, Y, Y, T, S , E and (3 are joined 
together by peptide bonds, 
25 further provided that at least two tyrosines in the 
compound are sulfated, 

wherein A is a molecule that self -oligomerizes , and the 
solid line represents a peptide linker or a peptide, 
disulfide, or other chemical bond. 



30 
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As used herein "peptide linker" is a peptide comprising 
consecutive amino acids having a sequence which includes 
but is not limited to GAG, SGGRGG and QSTRGGASGGG or 
repeating units thereof. One skilled in the art would know 
5 other flexible peptide linkers. 

In one embodiment of the above compound, the peptide that 
self -oligomerizes contains alpha-helical regions capable of 
forming coiled coils. 

10 

The a-helical coiled coil (48) is probably the most 
widespread subunit oligomerization motif found in proteins 
(48-52). It is a type of protein structure consisting of 
two to five amphipathic a-helices that "coil" around each 
15 other in a left-handed supertwist (48-52) . The sequences of 
coiled are characterized by a heptad repeat of seven 
residues with a hydrophobic repeat of mostly apolar amino 
acids . 

20 In one embodiment of the above compound, the peptide that 
self -oligomerizes is a peptide having a sequence of at 
least a portion HIV-1 gp41 heptad repeat sequence 1. In one 
embodiment, the HIV-1 gp41 heptad repeat sequence 1 is 
RQLLSG I VQQQNNLLRAI EAQQHLLQLTVWG I KQLQAR I LAVERYLKDQ (SEQ ID 

25 NO : 3) . 

In one embodiment of the above compound, the peptide that 
self -oligomerizes is a peptide having a sequence of at 
least a portion of an HIV-1 gp41 heptad repeat sequence 2. 
30 In one embodiment, the HIV-1 gp41 heptad repeat sequence 2 
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is WMEVJDRE I NNYTS LI H SL I EE SQNQQEKNEQELLE (SEQ ID NO: 4) . 

In one embodiment of the above compound, the peptide that 
self-oligomerizes is a peptide having a sequence 
corresponding to at least a portion of the leucine zipper 
region of transcription factor GCN4 . In one embodiment, 
the sequence of the leucine zipper region of transcription 
factor GCN4 is HMKQLEDKVEELLSKNYHLENEVARL.KKLVGER (SEQ ID 
NO : 6 ) . 

In one embodiment of the above compound, j the peptide that 
self-oligomerizes is a peptide having a sequence 
corresponding to at least a portion of the leucine zipper 
region of transcription factor GCN4 . In one embodiment, the 
sequence is derived from the leucine zipper region of 
transcription factor GCN4 . In one embodiment, the sequence 
forms trimeric coiled-coils . In one embodiment, the 
sequence is HMKQIEDKIEEILSKIYHIENEIARIKKLIGEV (SEQ ID 
NO: 7) . 



In one embodiment of the above compound, the peptide that 
self-oligomerizes is a peptide having a sequence 
corresponding to at least a portion of a leucine zipper 
region of a human protein. The human protein includes but 

25 is not limited to transcription activator c-fos, 
transcription activator c-jun, enzyme quiescent cell 
proline dipeptidase, macrophage scavenger receptor, 
salivary mucin (MUC7) , or human quiescent cell proline 
dipeptidase (QPP) . In one embodiment, the human protein is 

30 qpp and the leucine zipper region has the sequence 
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LLTVEQALADFAELLRALRRDL (SEQ ID NO: 5). In one embodiment, 
the transcription activator is c-fos and the leucine zipper 
region has the sequence 

LTDTLQAETDQLEDEKSALQTEIANLLKEKEKLEFILAAR (SEQ ID NO: 8) . In 
one embodiment, the transcription activator is c-jun and 
the leucine zipper region has the sequence 
HMRRIARLEEKVKTLKAQNSELASTAJvTMLREQVAQLKQKY (SEQ ID NO: 9) . 

In one embodiment of the above compound, the peptide that 
self-oligomerizes is a peptide having a sequence 
corresponding to that of at least a portion of an antibody. 
In one embodiment, the portion of the antibody comprises 
the heavy chain. In one embodiment, the portion of the 
antibody heavy chain comprises the heavy chain constant 
region. In one embodiment, the portion of the antibody 
heavy chain comprises the hinge and Fc domains. In one 
embodiment, the portion of the antibody heavy chain 
comprises the Fc domain. In one embodiment, the portion of 
the antibody comprises the light chain. In one embodiment, 
the portion comprises the light chain constant region. In 
one embodiment, the portion of the antibody comprises the 
heavy and light chains. 

This invention provides a compound having one of the 
following structures : 

A- (aYDINYYTSEpA) , ( 6aYDINYYTSE|3 ) "A, or A- (aYDINYYTSEp ) ~A " 

wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
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asparagine ,- 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
5 have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein (3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
10 joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; 

wherein A represents a carboxyl group or an amidated 
1 5 carboxyl group ; 

wherein 8 represents an amino group or an acetylated amino 
group ; 

wherein all of a , Y , D , I , N, Y , Y , T, S , E and (5 are joined 
together by peptide bonds, 
20 further provided that at least two tyrosines in the 
compound are sul fated, 

wherein A is toxin, and the solid line represents a peptide 
linker or a peptide, disulfide, or other chemical bond. 

25 In one embodiment of the above compound, the toxin is a 
radionuclide. In one embodiment, the radionuclide is an 
alpha-emitting isotope. The alpha-emitting isotope includes 
but is not limited to 225 Ac, 211 At, 212 Bi, or 213 Bi . In one 
embodiment, the radionuclide is a beta-emitting isotope. 

30 The beta-emitting isotope includes but is not limited to 
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186 Rh# 188 Rh 90 Y/ 131Jf Qr 6?Cu> In en fc odiment# 

radionuclide may be emitting Auger and low energy electron. 
The radionuclide includes but is not limited to 131 I, 12S i or 
77 Br. 

5 

In one embodiment of the above compound, the toxin is a 
chemical toxin. The chemical toxin may be a peptidyl 
chemical toxin. The peptidyl chemical toxin includes but is 
not limited to ricin. The chemical toxin may be a 
10 nonpeptidyl chemical toxin. The nonpeptidyl chemical toxin 
includes but is not limited to calicheamycin . 

This invention provides a compound having one of the 
following structures : 

15 A- (aYDINYYTSEgA) , ( 0aYDINYYTSE(3 ) ~A, or A- (aYDINYYTSEp ) -A, 

wherein each T represents a threonine, each S represents a 
.serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 

20 asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 

25 ID NO: 1 beginning with the I at position 9 and extending 
therefrom in the amino terminal direction; 

wherein p represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
30 have a sequence identical to the sequence set forth in SEQ 
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ID NO: 1 beginning with the P at position 19 and extending 
therefrom in the carboxy terminal direction; 

wherein A represents a carboxyl group or an amidated 
carboxyl group ; 

5 wherein 6 represents an amino group or an acetylated amino 
group ; 

wherein all of a,Y,D,I,N,Y / Y / T,S,E and £ are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
10 compound are sulfated, 

wherein A is molecule with anti-HIV activity, and the solid 
line represents a peptide linker or a peptide, disulfide, 
or other chemical bond. 

15 In one embodiment of the above compound, the molecule with 
anti-HIV activity is a CD4- immunoglobulin fusion protein. 
In one embodiment, the CD4 - immunoglobul in fusion protein is 
CD4-IgG2, wherein the CD4-IgG2 comprises two heavy chains 
and two lights chains, wherein the heavy chains are encoded 

20 by an expression vector designated CD4 - IgG2HC-pRcCMV (ATCC 
Accession No. 75193) and the light chains are encoded by an 
expression vector designated CD4 -kLC-pRcCMV (ATCC Accession 
No. 75194) . 

25 In one embodiment of the above compound, the molecule with 
anti-HIV activity is a compound which retards gp41 from 
adopting a conformation capable of mediating fusion of HIV- 
1 to a CD4+ cell by binding noncovalently to an epitope on 
a gp41 fusion intermediate. In one embodiment, the compound 

30 comprises a peptide selected from the group consisting of 
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T-20 (SEQ ID NO: 10), DP107 (SEQ ID NO: 11), N34 (SEQ ID 
NO: 12), C28 (SEQ ID NO: 13) , N34(L6)C28 (SEQ ID NO: 14), 
and T1249 (SEQ ID NO: 15) . 

5 As used herein, "T-20" and n DP178" are used interchangeably 
to denote a peptide having the following amino acid 
sequence : YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF (SEQ ID 
NO:10) and as described [53,54]. 

10 DP107 has the following amino acid sequence: 

NNLLRA I EAQQHLLQLTVWG I KQLQAR I LAVERYLKDQ (SEQ ID NO: 11) 

N34 has the following amino acid sequence: 
SGIVQQQNNLLRAIEAQQHL.LQLTVWGI KQLQAR (SEQ ID NO: 12) 

15 

C28 has the following amino acid sequence: 
WMEWDREINNYTSLIHSLIEESQNQQEK (SEQ ID NO: 13) 

N34(L6)C28 has the following amino acid sequence: 
20 SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARSGGRGGWMEWDREINNYTSLIHSLI 
EESQNQQEK (SEQ ID NO: 14) 

T124 9 has the following amino acid sequence : 
WQEWEQKI TALLEQAQI QQEKNEYELQKLDKWASLWEWF (SEQ ID NO: 15) 

25 

This invention provides the above compound wherein the 
molecule with anti-HIV activity is a CCR5 chemokine 
receptor targeting agent. 



30 



In one embodiment, the CCR5 chemokine receptor targeting 
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agent is an antibody or portion of an antibody. In one 
embodiment, the antibody includes but is not limited to PA 8 
(ATCC Accession No. HB-12605) , PA10 (ATCC Accession 
No. 12607), PA11 (ATCC Accession No. HB-12608) , PA12 (ATCC 
5 Accession No. HB-12609), and PA14 (ATCC Accession No. HB- 
12610) . In one embodiment, the antibody is PA14 (ATCC 
Accession No. HB-12610) . 

The antibody may be a monoclonal antibody or polyclonal 
10 antibody. The monoclonal antibody may be a human, humanized 
or chimeric antibody. This invention provides humanized 
forms of the above antibodies. 

As used herein, "humanized" describes antibodies wherein 

15 some, most or all of the amino acids outside the CDR 
regions are replaced with corresponding amino acids derived 
from human immunoglobulin molecules. In one embodiment of 
the humanized forms of the antibodies, some, most or all of 
the amino acids outside the CDR regions have been replaced 

20 with amino acids from human immunoglobulin molecules but 
where some, most or all amino acids within one or more CDR 
regions are unchanged. Small additions, deletions, 
insertions, substitutions or modifications of amino acids 
are permissible as long as they would not abrogate the 

25 ability of the antibody to bind a given antigen. Suitable 
human immunoglobulin molecules would include IgGl, IgG2, 
IgG3, IgG4, IgA and IgM molecules. A "humanized" antibody 
would retain a similar antigenic specificity as the 
original antibody, i.e., in the present invention, the 

30 ability to bind CCR5 . 
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One skilled in the art would know how to make the humanized 
antibodies of the subject invention. Various publications, 
several of which are hereby incorporated by reference into 
this application, also describe how to make humanized 
5 antibodies. For example, the methods described in United 
States Patent No. 4,816,567 (55) comprise the production of 
chimeric antibodies having a variable region of one 
antibody and a constant region of another antibody . 

United States Patent No. 5,225,539 (56) describes another 
approach for the production of a humanized antibody. This 
patent describes the use of recombinant DNA technology to 
produce a humanized antibody wherein the CDRs of a variable 
region of one immunoglobulin are replaced with the CDRs 
from an immunoglobulin with a different specificity such 
that the humanized antibody would recognize the desired 
target but would not be recognized in a significant way by 
the human subject's immune system. Specifically, site 
directed mutagenesis is used to graft the CDRs onto the 
framework . 

Other approaches for humanizing an antibody are described 
in United States Patent Nos . 5,585,089 (57) and 5,693,761 
(58) and WO 90/07861 which describe methods for producing 
25 humanized immunoglobulins. These have one or more CDRs and 
possible additional amino acids from a donor immunoglobulin 
and a framework region from an accepting human 
immunoglobulin. These patents describe a method to increase 
the affinity of an antibody for the desired antigen. Some 
30 amino acids in the framework are chosen to be the same as 
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the amino acids at those positions in the donor rather than 
in the acceptor. Specifically, these patents describe the 
preparation of a humanized antibody that binds to a 
receptor by combining the CDRs of a mouse monoclonal 
5 antibody with human immunoglobulin framework and constant 
regions. Human framework regions can be chosen to maximize 
homology with the mouse sequence. A computer model can be 
used to identify amino acids in the framework region which 
are likely to interact with the CDRs or the specific 
10 antigen and then mouse amino acids can be used at these 
positions to create the humanized antibody. 

The above patents 5,585,089 and 5,693,761, and WO 90/07861 
(59) also propose four possible criteria which may used in 

15 designing the humanized antibodies. The first proposal was 
that for an acceptor, use a framework from a particular 
human immunoglobulin that is unusually homologous to the 
donor immunoglobulin to be humanized, or use a consensus 
framework from many human antibodies. The second proposal 

20 was that if an amino acid in the framework of the human 
immunoglobulin is unusual and the donor amino acid at that 
position is typical for human sequences, then the donor 
amino acid rather than the acceptor may be selected. The 
third proposal was that in the positions immediately 

25 adjacent to the 3 CDRs in the humanized immunoglobulin 
chain, the donor amino acid rather than the acceptor amino 
acid may be selected. The fourth proposal was to use the 
donor amino acid reside at the framework positions at which 
the amino acid is predicted to have a side chain atom 

30 within 3A of the CDRs in a three dimensional model of the 



antibody and is predicted to be capable of interacting with 
the CDRs. The above methods are merely illustx'ative of some 
of the methods that one skilled in the art could employ to 
make humanized antibodies. 

This invention provides the above compound, wherein the 
portion of the antibody is a Fab fragment of the antibody. 
This invention provides the above compound, wherein the 
portion of the antibody comprises the variable domain of 
the antibody. This invention provides the above compound, 
wherein the portion of the antibody comprises a 
complementarity determining region or CDR portion of the 
antibody. The monoclonal antibody includes but is not 
limited to an IgG, IgM, IgD, IgA, or IgE monoclonal 
antibody . 

This invention provides the above compound, wherein the 
molecule with anti-HIV activity is a chemokine or chemokine 
derivative. The chemokine includes but is not limited to 
RANTES, MlP-la, MIP-13/ SDF-1 or other chemokine which 
blocks HIV-1 infection. The chemokine derivative includes 
but is not limited to Met - RANTES , AOP- RANTES , RANTES 9-68, 
or NNY- RANTES. 

The molecule may also be a non-chemokine agent capable of 
binding to chemokine receptors and inhibiting fusion of 
HIV-1 to CD4 + cells. The non-chemokine agents include, but 
are not limited to, chemokine fragments and chemokine 
derivatives and analogues. In one embodiment, the agent 
does not include naturally occurring chemokines . The non- 
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chemokine agents include multimeric forms of the chemokine 
fragments and chemokine derivatives and analogues or fusion 
molecules which contain chemokine fragments, derivatives 
and analogues linked to other molecules. In one 

5 embodiment, the non-chemokine agents do not include 
bicyclams and their derivatives as described in U.S. Patent 
No. 5,021,409, issued June 4, 1991, the content of which is 
incorporated by reference into this application. Some 
bicyclam derivatives have been previously described with 
10 antiviral activities (60, 61). 

In an embodiment of this invention, the non-chemokine agent 
is an oligopeptide. In another embodiment, the non- 

chemokine agent is a polypeptide. In still another 

15 embodiment, the non-chemokine agent is an antibody or a 
portion thereof. Antibodies against the chemokine receptor 
may easily be generated by routine experiments. It is also 
within the level of ordinary skill to synthesize fragments 
of the antibody capable of binding to the chemokine 

20 receptor. In a further embodiment, the non-chemokine agent 
is a nonpeptidyl agent such as TAK-779 (64) or AMD3100 
(65) . 

Non-chemokine agents which are purely peptidyl in 
25 composition can be either chemically synthesized by solid- 
phase methods (62) or produced using recombinant technology 
in either prokaryotic or eukaryotic systems. The synthetic 
and recombinant methods are well known in the art. 



30 Non-chemokine agents which contain biotin or other 
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nonpeptidyl groups can be prepared by chemical modification 
of synthetic or recombinant chemokines or non-chemokine 
agents. One chemical modification method involves 

periodate oxidation of the 2-amino alcohol present on 
5 chemokines or non-chemokine agents possessing serine or 
threonine as their N-terminal amino acid (63). The 
resulting aldehyde group can be used to link peptidyl or 
non-peptidyl groups to the oxidized chemokine or non- 
chemokine agent by reductive amination, hydrazine, or other 
10 chemistries well known to those skilled in the art. 

This invention provides a compound having one of the 
following structures : 

A- (aYDINYYTSEPA) , (0aYDINYYT£E3 ) ~A, or A- (aYDINYYTSE^ ) ~A 

15 wherein each T represents a threonine, each S represents a 
serine, each E represents a glutamic acid, each Y 
represents a tyrosine; each D represents an aspartic acid, 
each I represents an isoleucine; and each N represents an 
asparagine ; 

20 wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 
ID NO: 1 beginning with the I at position 9 and extending 

25 therefrom in the amino terminal direction; 

wherein 3 represents from 0 to 13 amino acids, with the 
proviso that if there are more than 2 amino acids, they are 
joined together by peptide bonds in consecutive order and 
have a sequence identical to the sequence set forth in SEQ 

30 ID NO: 1 beginning with the P at position 19 and extending 
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therefrom in the carboxy terminal direction ; 

wherein A represents a cax'boxyl group or an amidated 
carboxyl group ; 

wherein 0 represents an amino group or an acetylated amino 
5 group; 

wherein all of a, Y, D, I,N,Y,Y,T,S,E and P are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 
10 wherein A is peptidyl or nonpeptidyl agent, and the solid 
line represents a peptide linker, or a peptide, disulfide, 
or other chemical bond. 

In one embodiment, the A is a nonpeptidyl agent, and the 
15 nonpetidyl agent polyethylene glycol. 

This invention will be better understood from the 
Experimental Details that follow. However, one skilled in 
the art will readily appreciate that the specific methods 
20 and results discussed are merely illustrative of the 
invention as described more fully in the claims that follow 
thereafter . 
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EXPERIMENTAL DETAILS 

First Series of Experiments 
A . Materials 

5 Purified recombinant CD4-IgG2 protein was produced by 
Progenies Pharmaceuticals, Inc. from plasmids CD4-IgG2-HC- 
pRcCMV and CD4 -kLC-pRcCMV as described (Allaway et al . AIDS 
Res. Hum. Retroviruses 11:533, 1995). Soluble CD4 is 
commercially available (NEN Life Science Products, Boston, 
10 MA) . Anti-CCR5 MAb 2D7 was purchased from Pharmingen (San- 
Diego, CA) . 

The plasmids designated PPI4 - tPA-gpl2 0 JR _ FI> -V3 (_) and PPI4-tPA- 
gpl2 0 DH123 were prepared as described (Hasel et al , US 

15 Patents 5,869,624 and 5,886,163). Monomeric gpl20 

glycoproteins were produced in CHO cells stably transfected 
with the PPI4 - tPA-gpl20 plasmids and purified to 
homogeneity as described (Hasel et al . US Patents 5,869,624 
and 5,886,163; Trkola et al . Nature 384:184, 1996). The 

20 antibodies designated PA8 , PA10, PA12 and PA14 were 
prepared by growing the corresponding hybridoma cell line 
in mouse ascites and isolating the antibody using protein A 
affinity chromatography as described (Olson et al . J.Virol. 
73:4145, 1999). LI . 2 - CCR5 + eel 1 s were cultured as described 

25 (Olson et al . J.Virol. 73:4145, 1999). 

Peptides containing different segments of the CCR5 Nt were 
custom- synthesized by solid-phase f luorenylmethoxycarbonyl 
chemistry using phospho- and sulfo- tyrosine precursors as 
30 building blocks where indicated (Figure 6) . Biotinylated 
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versions of peptides S-10/14 and P-10/14 incorporated a C- 
terminal GAG spacer preceding a biotinylated lysine. 
Following cleavage from the resin, peptides were purified 
by reverse -phase chromatography on CI 8 columns (Vydac, 
5 Hesperia, CA) and analyzed by HPLC and mass spectroscopy. 
Figure 6 describes the different peptides that were used in 
this study. 

Binding of opl20 to CCR5 

10 A gp!20/CD4 complex formed from monomer ic gpl20 (lOOnM) and 
biotinylated CD4-lgG2 (50nM) was added to lxlO 6 L1.2-CCR5* 
cells in the presence of different concentrations of 
peptide (Olson et al . J.Virol. 73:4145, 1999). CD4-IgG2 is 
tetrameric and therefore binds four molecules of gp!20, 

15 which increases binding of the complex to CCR5 (Allaway et 
al . AIDS Res. Hum. Retroviruses 11:533, 1995). The mean 
fluorescence intensity (m.f.i.) was measured by flow 
cytometry after addition of phycoerythrin (PE) -labeled 
streptavidin (Becton Dickinson, San Jose, CA) . Inhibition of 

20 gpl20/CCR5 binding was calculated: (m.f.i. with 
peptide) / (m. f . i . without peptide) x!00%. 

It was first tested whether tyrosine-sulfated peptides 
spanning amino acids 2-18 of the CCR5 Nt could inhibit 

25 binding of the gp!2 0^.^/004 - IgG2 complex to CCR5 + cells. The 
HIV-1 JR . FL isolate exclusively uses CCR5 as a co-receptor 
(Dragic et al . Nature 381:667, 1996). Only peptides S- 
3/10/14 and S-10/14 inhibited complex binding to the cells 
in a dose -dependent manner (Fig. la). Peptides S-10 and S- 

30 14 had no inhibitory activity, even at the highest 
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concentrations (Fig. la). Peptide TS-10/14, spanning amino 
acids 10-14 , did not inhibit gpl2 0jr.pl/CD4 - IgG2 binding to 
CCR5 + cells, despite the presence of two sulfo- tyrosine 
residues (Fig. lb) . 

5 

Tyrosine -phosphorylated peptides P-10/14 and P-3/10/14 did 
not inhibit gp!2 0 JR _ FL /CD4 -IgG2 binding to CCR5 + cells (Fig. 
lb) . As further specificity controls we synthesized 
peptides containing the first seventeen residues of the 
10 CCR5 Nt in random order with sulfo- tyrosines in positions 
10 and 14 (SS-10/14) or in positions 2 and 12 (SS-2/12) . 
Neither one of these peptides reduced gpl2 0jr.pl/CD4 - IgG2 
binding to CCR5 4 cells, even at the highest concentrations 
(Fig. lb) . 

15 

Surface plasmon resonance measurements (BIAcore) 
Streptavidin- coated sensor chips (BIAcore AB, Sweden) were 
conditioned with five injections of regeneration solution 
(1M NaCl, 50mM NaOH) and equilibrated with HBS-EP buffer 

20 (lOmM HEPES, 150mM NaCl, 3M EDTA, 0.005% polysorbate 20) as 
recommended by the manufacturer. Biotinylated peptides were 
then immobilized on the chip by injection of peptide 
(lOOnM) in HBS-EP buffer, followed by an injection of 
regeneration solution and equilibration with HBS-EP buffer. 

25 4 00 resonance units (RU) of peptide were bound to the 
sensor chip surface. Solutions of the following proteins 
(lOOnM) were passed over the sensor chip surface: gpl20, 
sCD4 , gpl20/sCD4, PA8 , PA10 and 2D7 . Surface plasmon 
resonance was monitored and displayed in arbitrary 

30 resonance units (RU) as a function of time. Following 
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injection of each solution the chip was regenerated and 
equilibrated as described above. 



Biotinylated peptide was attached to the streptavidin- 
5 coated gold surface of a sensor chip and solutions 
containing different gpl20/sCD4 complexes were flowed over 
the immobilized peptide. Adsorption of the complex due to 
complex/peptide binding was detected by an increase in 
surface plasmon resonance signal (RU) , which reports 

10 changes in the effective refraction index very near the 
gold surface of the sensor chip (Schuck Ann. Rev. Biophys 
Biomol Struct 26:541, 1997). For proteins of similar size, 
such as the different gp!20/sCD4 complexes, RU plateau 
values are directly proportional to the amount of protein 

15 bound to the peptide. 



Specific association of the gpl2 0 JR _ FL /sCD4 complex with the 
sulf o- tyrosine -containing peptide bS-10/14 was accompanied 
by a significant increase in RU (Fig. 2a). The signal 

20 plateau but not the shape of the sensograms varied with 
gpl 2 0jr.pl/sCD4 concentration indicating that the 

pept ide/complex interaction was dose-dependent (data not 
shown) . The sensorgram obtained with bP-10/14 is similar 
to the one obtained in the absence of peptide, indicating a 

25 complete lack of association of the phosphorylated peptide 
with the protein complex (Fig. 2a). Neither gpl20 JK . FL nor 
sCD4 alone produced a significant increase in RU, 
indicating that they did not associate with the immobilized 
peptides. (Fig. 2b, c). The gpl2 0 -AV3jr.pl/sCD4 complex was 

30 also unable to associate with the peptides (Fig. 2d) . 
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To further ascertain the specificity of the peptide/complex 
association we performed BIAcore analyses using envelope 
glycoproteins from HIV-1 DH123 , an R5X4 isolate, and HIV-1^, 
an X4 isolate (5) . gpl20 DH123 /sCD4 associated specifically 
5 with the sulfated peptide, although the plateau RU values 
were lower than those observed with gpl 2 0jr.pl/sCD4 (Fig. 
2e) . We did not detect any binding of gpl20 DH123 /sCD4 to the 
phosphorylated peptide (Fig.2e), nor did gpl20 DH123 alone 
associate with the peptides (Fig. 2f ) . Finally, gpl20 LAI with 
10 or without sCD4 was not able to associate with either one 
of the peptides (Fig. 2g,h) . 

These methods could be readily modified to screen for 
agents that bind CCR5 or that block its interaction with 

15 antibodies, gpl20 or other ligands. For example, direct 
binding of the agents could be analyzed as described above, 
where the agent is substituted for the anti-CCR5 antibody 
or gpl2 0/sCD4 complex. In another embodiment, the agent 
could be mixed or pre- incubated with the anti-CCR5 antibody 

20 (or gp!20/sCD4 complex) prior to passing the mixture over 
biosensor chips as described above. 

Binding of MAbs to CCR5 

L1.2-CCR5 cells (IxlO 6 ) were incubated with anti-CCR5 MAb 
25 (50nM) + peptide (100/iM) . MAb binding was detected using a 
PE-labeled goat anti -mouse antibody (Caltag Laboratories, 
Burlingam, CA) . The m.f.i value was measured by flow 
cytometry as described (Olson et al . J. Virol. 73:4145, 
1999) . MAb binding was calculated as above. 



30 
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We determined whether the CCR5 Nt peptides could inhibit 
binding of a panel of anti-CCR5 MAbs to CCR5 + cells. PA 8 
binding was reduced significantly by all wild-type peptides 
containing amino acids 2-18, regardless of tyrosine 
5 modification (Fig. 3). BIAcore analysis confirmed that PA8 
similax'ly and specifically associated with both sulfated 
and phosphorylated peptides (Fig. 4). Binding of PA12 to 
CCR5 was not inhibited by any of the peptides (Fig. 3) . 
PA10 binding to CCR5 was inhibited only by S-3/10/14 (Fig. 

10 3) . PA10 was also observed to associate with bS-10/14 and 
to a lesser extent with bP-10/14 in BIAcore analysis (Fig. 
4) , which may be more sensitive than the gpl20/CCR5-binding 
assay. Binding of 2D7 to CCR5 was not inhibited by any of 
the peptides (Fig. 3). No significant interaction was 

15 observed between any CCR5 Nt peptide and Mab 2D7 (Figs. 3 
and 4), whose epitope resides within the second 
extracellular loop on CCR5 . 

Single cycle HIV-1 entry assay 

20 Nlluc'env particles pseudotyped with envelope glycoproteins 
from MuLV, HTLV-1 and HIV-1 strains JR-FL, HxB 2 , DH123 , Gun- 
1 were made as descx-ibed (Dragic et al . J. Virol. 72:279, 
1998) . Target cells (Hela -CD4 + CCR5 + or U87-CD4 + CCR5 + ) were 
incubated with virus -containing supernatant fractions 

25 (lOOng/ml p24) ± peptide (100/zM) for 4 h. then washed and 
resuspended in culture media. After 4 8 hours the cells were 
lysed and luciferase activity (relative light units, 
r.l.u.) was measured using a standard kit (Promega, 
Madison, WI) as described (Dragic et al . J. Virol. 72:279, 

30 1998). Viral entry was calculated: (r.l.u. with 
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peptide) / (r . 1 . u without peptide) xlOO%. 

The ability of different CCR5 Nt peptides to inhibit HIV-1 
entry into CD4 + CCR5 + CXCR4 + cells was tested using a 
5 lucif erase-based single round of entry assay (5) . Only 
peptides S-10/14 and S-3/10/14 inhibited the entry of the 
R5 isolate HIV-1 JR . FL by approximately 50% in HeLa-CD4 + CCR5 + 
and U87MG-CD4 + CCR5 + (Fig. 5 and data not shown) . We were 
unable to inhibit the entry of the R5X4 isolates HIV-1 DH123 
10 and HIV-1 

Gun-i / °f the X4 isolate . The entry of 

MuLV and HTLV pseudotypes was also unaffected by the 
peptides (Fig. 5). 

15 Screening assays 

1) HIV-1 gpl20/CD4-IgG2 

Streptavidin-coated 96-well microtiter plates (NEN Life 
Science Products, Boston, MA) are blocked with 200 /il/well 
of 5% bovine serum albumin (Sigma, St. Louis, MO) in PBS 

20 buffer and washed with assay buffer (0.5% Tween 20, 1 % 
fetal bovine serum, and 2% BSA in PBS buffer) . The plates 
are then incubated 1 hour at ambient temperature with 100 
^I/well of biotinylated CCR5 N- terminal sulfopeptide at a 
concentration of 500 fiM in assay buffer. Following a wash 

25 step, the plates are incubated for 1 hour at ambient 
temperature with an HIV-1 JR . FL gpl20/CD4 - IgG2 complex in the 
presence or absence of inhibitory agent . The plates are 
again washed and incubated for 30 minutes with a 
horseradish peroxidase-labeled goat antibody to human IgG 

30 (Kirkegaard & Perry Laboratories, Gaithersburg , MD) 
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followed by addition of the TMB (3,3' ,5,5'- 
tetramethylbensidine) chromogenic substrate (Pierce) . The 
reaction is stopped by addition of 100 /xl/well of 2N H 2 S0 4 
prior to colorimetric detection at a wavelength of 450 nm. 
Wells without biotinylated peptide serve as negative 
controls. The percent inhibition of binding is calculated 

aS " (° D with inhibitor ~ 0D ccntrol well)/ (OD without - inhib i tor - OD control 

wen)] x 100, where OD represents the average optical density 
observed for the indicated wells. 

2 ) Ant i -CCR5 antibodies 

Streptavidin-coated microtiter plates are blocked and 
incubated with CCR5 N- terminal peptide as described above. 
Following a wash step, the plates are incubated for one 
hour at ambient temperature with the anti-CCR5 antibody 
PA10 in the presence or absence of inhibitory agent. The 
plates are again washed and incubated for 30 minutes with a 
horseradish peroxidase- labeled goat antibody to mouse IgG 
(Kirkegaard & Perry Laboratories, Gaithersburg , MD) 
followed by addition of TMB substrate for colorimetric 
detection as described above. The percent inhibition 
mediated by the inhibitory agent is calculated as described 
above . 

Discussion 

Tyrosine-modified peptides spanning the region of the CCR5 
Nt that contains residues important for viral entry were 
synthesized. (Dragic et al . J. Virol. 72:279, 1998; Rabut 
et al . J. Virol. 72:3464, 1998; Farzan et al . J. Virol. 
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72:1160, 1998; Dorantz et al . J. Virol. 71:6305, 1997). 
Interactions between the Nt peptides and gp!20/CD4 
complexes were characterized. Peptides containing sulfo- 
tyrosines in positions 10 and 14 efficiently inhibited 
5 binding of gpl2 0 JR _ FL /CD4 to CCR5 . Substitution of the 
sulfate groups for phosphates, which are also negatively 
charged at physiological pH, rendered the Nt peptides 
inactive. Inhibition of gpl20/CCR5 binding was dependent, 
t h ere fore, on the presence of sulfate moieties and was not 

10 simply due to non-specific electrostatic interactions 
between the peptide and the gpl20/CD4 complex or the 
peptide and the cell surface. Inhibition of gpl20/CCR5 
binding was also dependent on the primary structure 
surrounding the sulfo- tyrosines since peptides with random 

15 sequences of CCR5 amino acids 2-18 had no inhibitory 
activity. Additional Nt amino acids in the region 2-18 were 
important for activity since a shortened peptide containing 
just amino acids 10-14 was unable to inhibit gpl20/CD4 
binding, despite the presence of two sulfo- tyrosines . It 

20 would be straightforward to define the minimum number of 
amino acids needed for activity by systematically 
synthesizing sulf opept ides intermediate in length between 
peptide 2-18 and peptide 10-14. Similarly, sulf opept ides 
that incorporate a greater portion of the CCR5 Nt could be 

25 easily synthesized and tested for activity using the 
methods described herein. 

Qualitative BIAcore analyses allowed the demonstration of a 
highly specific, CD4-dependent interaction between a 
30 tyrosine-sulfated Nt peptide and gpl2C JR . Flj . No binding of 
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the protein complex to a tyrosine -phosphorylated peptide 
was observed. Only gpl20s derived from isolates that use 
CCR5 as a co-receptor associated with the sulfated peptide. 
gpl2 0 DH123 /CD4 binding was weaker than gpl 2 0jr.pl/CD4 binding, 
5 suggesting that envelope glycoproteins from R5X4 isolates 
have a lower apparent affinity for CCR5 than envelope 
glycoproteins from R5 isolates. gpl20 1AI/ derived from an 
isolate that only uses CXCR4 , did not bind to the sulfated 
peptide. A V3 loop-deleted gpl20 JR _ FL did not associate with 
10 the sulfated peptide, just as this protein was unable to 
bind to full length CCR5 on the cell surface (Trkola et al . 
Nature 384:184, 1996). 

The binding of the Nt peptides to several anti-CCR5 MAbs, 

15 all of which recognize conformational epitopes in CCR5 and 
inhibit gpl20/CCR5 binding were also studied. PA12 and 2D7 
did not bind to any of the peptides. Binding of PA 8 to the 
peptides was independent of tyrosine-modif ication whereas 
PA10 associated more with the sulf o- tyrosine-containing 

20 peptide than with the phospho-tyrosine-containing peptide. 
It seems, therefore, that sulf o- tyrosines and phospho- 
tyrosines are relatively interchangeable for the purpose of 
MAb binding but that gpl20/CD4 binding has an absolute 
requirement for sulf o- tyrosines . Relatively subtle 

25 differences in size and geometry of sulfate and phosphate 
groups might be relevant for binding of the CCR5 Nt with 
gpl2 0, which must not only accept the negative charge, but 
also coordinate, probably by hydrogen bonds, the tyrosine 
sulfate oxygens. The kinetics of MAb binding to the CCR5 Nt 

30 peptides exhibited large apparent on rates and slow 
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apparent off rates, which also differed from our 
observations of gpl20/CD4 binding kinetics. 

None of the Nt peptides inhibited MuLV, HTLV and HIV-l,^ 
envelope-mediated viral entry, which is not mediated by 
CCR5 . In contrast, peptides S-10/14 and S-3/10/14 
specifically inhibited the entry of the HIV-l^.p^ R5 strain 
in two different cell lines. The inhibition of HIV-1 entry 
by tyrosine-sulfated peptides was partial (-50%) but 
nonetheless striking given the difficulty of blocking this 
process with short, linear peptides (Jameson et al . Science 
240:1335, 1988; Chan and Kim Cell 93:681:1998; Doranz et 
al . J. Exp. Med. 186:1395, 1997; Heveker et al . Current 
Biology 8:369, 1998; Eckert et al . Cell 99:1, 1999). 
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Second Series of Exper iments 

CD4 and CCR5 mediate fusion and entry of R5 HIV-1 strains. 
Sulfotyrosine and other negatively charged residues in the 
CCR5 amino- terminal domain (Nt) are crucial for gpl2 0 
5 binding and viral entry. It is shown that a soluble 
g P 120/CD4 complex specifically binds to a peptide 
corresponding to CCR5 Nt residues 2-18, with sulf otyrosines 
in positions 10 and 14. This sulfopeptide also inhibits 
soluble gpl20/CD4 binding to cell surface CCR5 as well as 
10 infection by R5 virus. These observations prompted the 
further delineation of the determinants of the gpl20-CCR5 
Nt sulfopeptide interaction. It is shown that residues 10- 
18 constitute the minimal domain of the CCR5 Nt that is 
able to specifically interact with soluble gl20/CD4 
15 complexes. In addition to sulf otyrosines in positions 10 
and 14, negatively charged residues in positions 11 and 18 
participate in this interaction. Furthermore, the CCR5 Nt 
binds to a CD4-induced surface on gpl20 that is composed of 
conserved residues in the V3 loop stem and the C4 domain. 
20 Binding of gpl2 0 to cell surface CCR5, however, is further 
influenced by variable residues in the crown of the V3 
loop. This data suggest that gpl20 docking to CCR5 is an 
interdependent, multi-step process involving different 
regions of the envelope glycoprotein and the co-receptor. 

Entry of HIV-1 R5 isolates into target cells is mediated by 
the successive interaction of the envelope glycoprotein 
gpl20 with CD4 and the CCR5 co-receptor [3] . Gpl20-CD4 
complex formation generates a large bonding energy that 
30 drives reordering of the gpl20 core structure [22, 31, 47]. 
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Changes in the orientation of the V1/V2 and V3 loops, as 
well as the bridging sheet (composed of the V1/V2 stem and 
C4) , cooperatively create and/or expose a co-receptor 
binding site on gpl20 [22,37,47]. The predicted co-receptor 
binding surface on gpl20 has a hydrophobic core surrounded 
by a positively charged periphery and is composed of both 
conserved and variable residues located in the C4 domain 
and V3 loop, with lesser contributions from the V1V2 stem 
[22, 36, 37] . 



It has been demonstrated that specific amino acids within 
the CCR5 amino- terminal domain (Nt , amino acids 2-31) , 
including negatively charged and tyrosine residues, are 
essential for CCR5-mediated fusion and entry of R5 and R5X4 
15 HIV-1 strains [5, 12, 13, 15, 35]. Farzan et al . [16] 
demonstrated that the CCR5 Nt undergoes both O- 
glycosylation and tyrosine sulfation. It is presently not 
known whether O-glycosylation plays a role in co-receptor 
function, but this possibility is suggested by observations 
20 that serines in the Nt are important for viral entry. 
Inhibition of cellular sulfation pathways, including 
tyrosine sulfation, greatly decreases gpl20 binding to CCR5 
as well as the entry of R5 and R5X4 HIV-1 strains into 
target cells ([16], E.G.C. unpublished data). Post- 
25 translational sulfation of the tyrosine residues in the 
CCR5 Nt, therefore, may critically modulate the 
susceptibility of target cells to HIV-1 infection in vivo. 

It was demonstrated that a CCR5 Nt -based peptide spanning 
30 residues 2-18 and containing sulf otyrosines in positions 10 
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and 14 specifically associates with soluble gpl20/CD4 
complexes containing envelope glycoproteins from R5 (JR-FL) 
and R5X4 (DH123) but not X4 (LAI) strains [11] . Peptides 
containing unmodified tyrosines or phosphotyrosines , 
5 however, did not bind soluble gpl20/CD4 complexes [11] . The 
tyrosine-sulfated CCR5 Nt therefore specifically interacts 
only with gpl20 proteins from isolates that use this co- 
receptor to gain entry into target cells. Furthermore, only 
the CCR5 Nt-based sulfopeptide inhibits binding of soluble 

10 gpl2 0jr.pl/CD4 to intact, cell surface-expressed CCR5 and 
moderately blocks the entry of the R5 isolate JR-FL. The 
affinity of soluble gpl20/CD4 for the CCR5 Nt sulfopeptide, 
however, is approximately 10-100-fold lower than for the 
native, membrane-associated co-receptor [11, 42, 46], 

15 suggesting that other gpl20-CCR5 contacts are required to 
consolidate this interaction. This concept is further 
supported by studies of CCR5 chimera, as well as studies 
with inhibitors of CCR5 co-receptor function [12, 34, 38, 
32, 14] . 

20 

A novel ELISA is reported to detect binding of 
sulf opeptides to soluble gpl20/CD4 complexes, as well as 
anti-CCR5 MAbs and chemokines. ELISA and surface plasmon 
resonance (SPR) were used to further delineate the 

25 determinants of the gp!20-CCR5 Nt interaction. In order to 
define the minimal domain of the CCR5 Nt capable of 
specifically binding to soluble gp!20/CD4 complexes, 
sulf opeptides corresponding to different regions of the Nt 
were analyzed. To identify the gpl20 domains involved in 

30 sulfopeptide binding, inhibition of gpl20/CD4 complex 
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binding to CCR5 Nt sulf opeptides by anti-gpl20 Mabs was 
studied. Residues in or near the epitopes of inhibitory 
MAbs were mutated to alanine, and the gpl20 point mutants 
were compared for their ability to bind to CCR5 Nt 
sulfopeptides and cell-surface CCR5 . The data suggest that 
a mostly conserved surface of gpl20 binds to a nine-residue 
stretch of the CCR5 Nt , whereas more variable residues in 
the crown of the V3 loop may interact with a secondary 
binding site on CCR5 . 

Materials and Methods 

Reagents : CD4-IgG 2 , soluble CD4 (sCD4) , recombinant soluble 
gpl20s from HIV-1^ (X4), HIV-1 DH123 (R5X4), and HIV-l JR . PIj (R5) 
isolates, anti-gpl20 MAb PA1 (directed against the V3 loop 
of JR-FL) and anti-CCR5 MAbs PAS , PA10, PA11, PA12 , PA14 
were produced by Progenies Pharmaceuticals, Inc. 
(Tarrytown, NY) as described [1, 32]. MAbs 133-290, 133- 
192, 135-9, A32, 17b, 19b, 48d, 9284, G3-42, Cll, G45-60 
and 2G12 were a generous gift [26] . The small -molecule CCR5 
antagonist TAK-779 was obtained as described [14] . 



Peptides corresponding to different segments of the CCR5 Nt 
were synthesized as described previously (Table 1) [11] . 
25 Sulfo- or phospho-tyrosines were incorporated in positions 
10 and 14, and all peptides carried a carboxy- terminal Gly- 
Ala-Gly spacer preceding a biotinylated lysine. Residues 
were numbered according to their positions in the full 
length CCR5 protein. 
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Surf ace plasmon resonance : Einding of gpl20/CD4 -IgG 2 complex 
and MAbs to CCR5 Nt-based peptides was measured as 
previously described [11] . Briefly, streptavidin-coated 
sensor chips were divided into two surfaces, each with a 
5 separate flow chamber. The sensor chip was conditioned and 
equilibrated as recommended by the manufacturer. 
Biotinylated peptide (400 resonance units, RU) was bound to 
the surface of the second chamber whereas the first chamber 
of the chip was used as a negative control. Gpl2 0/CD4 - lgG 2 

10 complex (50 nM) was passed over the chip surface in the 
presence or absence of MAbs (150 mM) .■ Surface plasmon 
resonance was monitored and displayed in RU as a function 
of time using a Biacore X. After each measurement the chip 
was regenerated and equilibrated as recommended by the 

15 manufacturer. 

Generation of ap!20 alanine mutants and their binding to 
CD4 -IgG o : Mutant gpl20 proteins were generated using the 
QuickChange Kit from Stratagene (San-Diego, CA) . Gpl2 Oj^.n,, 

20 cloned into the pPPI4 expression vector [4] , served as the 
template for site directed mutagenesis. Nucleotide 
sequencing was performed to ascertain the presence of the 
appropriate mutation in the gpl20 coding sequence. 293T 
cells were calcium phosphate transfected with the different 

25 mutant gpl20 expression constructs. Supernatants containing 
soluble gp!20 proteins were harvested and cleared of debris 
by centrif ugation 24 hours post - transf ection . 

Quantification of gpl20 was performed by ELISA as 
previously described [40] ) . Briefly, 293T supernatants were 

30 boiled for 5 minutes and denatured gpl20 was captured on an 
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ELISA plate coated with D7324 (International Enzymes Inc. 
Fallbrook, CA) , a MAb that recognizes a 15 residue linear 
epitope in the carboxy- terminal end of gpl20. Captured 
gpl20 was detected by a mixture of anti-gpl20 MAbs B12 and 
B13 [40] , followed by incubation with a horseradish 
peroxidase-conjugated (HRP) anti-mouse IgG antibody 
(Amersham Pharmacia, Piscataway, N.J.)- Optical density 
(O.DJ was measured at 450 nm using the ImmunoPure TMB 
Substrate kit (Pierce, Rockford, IL) . 



CD4-IgG 2 binding to non-denatured mutant gpl20 proteins also 
was measured. Plates coated with D7324 were used to capture 
native gpl2 0 from supernatants of transiently transfected 
293T cells. CD4-IgG 2 (50 nM) was added to the plates and its 
15 binding was detected using an HRP- conj ugated goat anti- 
human IgG and TMB substrate as described above. 

CCR5 Nt peptide ELISAs : Streptavidin- coated ELISA plates 
(Pierce, Rockford, IL) were blocked with D-PBS/ 5% BSA for 

20 2 hours at room temperature then washed three times with 
assay buffer (D-PBS/ 0.5% Tween 20/ 1% Fetal Bovine Serum/ 
2% BSA) . Plates were then contacted with sulfo- or phospho- 
peptides (1 /ig/ml) for 1 hour at room temperature and 
washed three times with assay buffer. Mixtures of CD4-IgG 2 

25 (50 nM) and purified gp!20 or gp!20 -containing supernatants 
in a 1:4 molar ratio were added to the plates for 1 hour at 
room temperature. Plates were washed three times and (HRP) - 
conjugated goat anti-human IgG was used to detect the 
presence of bound CD4-IgG 2 . The plates were developed using 

30 the TMB substrate as described above. Gpl20/CD4-IgG 2 binding 
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to the peptides was normalized for CD4-IgG 2 binding to the 
mutant gpl20 proteins. 

In a competition ELISA, peptides were captured onto the 
5 plates as described above. Inhibitor or assay buffer was 
added for 1 h prior to addition of gpl2 0 JR _ FL /CD4 - IgG 2 complex 
(1 nM) for an additional h at room temperature. The assay 
was then completed as described above. Direct binding of 
ant i -CCR5 murine MAbs to the peptides was examined as 
10 described above except that MAb was substituted for 
gpl20/CD4 - IgG 2 complex and a goat anti-mouse HRP-coupled 
antibody was used for detection. 

Binding of op 12 0/ CD4 - I oG ? complexes to cell -surface CCR5 : 
L1.2-CCR5 4 cells (10 6 ) were incubated with gpl20 -containing 
supernatant (lOOnM) and biotinylated CD4-IgG 2 (50nM) fox- 1 
hour at 37°C in assay buffer, as previously described [32] . 
Gpl20/CD4 -IgG 2 bound to the cells was revealed by FACS 
analysis of the mean fluorescence of intensity (m.f . i.) 
after addition of streptavidin-PE (Pharmingen, San-Diego, 
CA.). Binding was calculated using the formula: (m.f.i. 
gpl20 mutants) / (m. f . i . gpl20 wild type) x 100% and 
normalized for CD4-IgG 2 binding to the mutant gpl2 0 
proteins . 

Results 

An ELISA to detect binding of soluble gpl20/CD4 complexes 
to CCR5 Nt -based peptides : Surface plasmon resonance (SPR) 
was previously used to show that gp!20/sCD4 complexes 
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specifically interact with a peptide spanning CCR5 residues 
2-18 and containing sulf otyrosines in positions 10 and 14 
(2-18, Table 1). The on and off rates of complex-peptide 
binding were extremely rapid and could not be measured 

5 precisely by SPR. The Kd was estimated to be in the 10~ 7 -10" 8 
range. Replacing monomeric sCD4 with tetravalent CD4-IgG 2 , 
however, lead to a dramatic shift in both on and off rates, 
lowering the Kd into the 10- 8 -l(r 9 range (Figure 7a) . This 
observation prompted us to develop an ELISA to directly 

10 detect complex-peptide binding. Streptavidin-coated ELISA 
plates were used to capture biot inylated, CCR5 Nt-based 
peptides, and then further incubated with soluble 
gpl2 0/CD4-IgG 2 complexes. Complex binding was detected using 
an HRP- conjugated goat anti-human IgG antibody. 

15 

Sulfopeptide 2-18 bound gpl2 0jr.pl/CD4 - IgG 2 with an IC 50 ~lnM, 
and gpl2 0 DH123 /CD4-IgG 2 with an IC 50 ~5nM (Figure 7b) . 
Sulfopeptide 2-18 did not measurably bind CD4-IgG 2 alone or 
in complex with either gpl20 LAI or V3 loop-deleted gpl20 JR . Flj 

20 (Figure 7b and data not shown) . No binding was observed to 
an analogous CCR5 Nt phosphopept ide (2-18 (P) f Figure 12) by 
any of the gpl2 0/CD4 - lgG 2 complexes (Figure 7b) . Identical 
patterns of reactivity were observed for gpl20s in complex 
with CD4~y2, a divalent CD4- immunoglobulin fusion protein 

25 [data not shown, (1)] However, no binding was observed for 
gpl20 in complex with anti-gp!20 MAbs 2G12 and IgGlbl2, 
even though the latter' s epitope overlaps the CD4 binding 
site on gpl20 (data not shown) . Thus the ELISA reproduces 
the critical biological features of cell-surface CCR5/gpl20 

30 interactions, including a dependence upon CCR5 tyrosine 
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sulfation, CD4 , the V3 loop, and the coreceptor usage- 
patterns of the parent viruses . 



Using a competition ELISA, inhibition of gpl 2 0jr.pl/CD4 -IgG2 
5 binding to the sulfopeptide 2-18 was enabled with the anti- 
CCR5 MAb PA8 . However, binding of soluble gpl2 0/CD4-IgG 2 
complexes to the sulfopeptide was not inhibited by TAK-779, 
nor the CC-chemokines MlP-la, MIP-lp and RANTES even when 
used at supraphysiologic concentrations. 

10 

Binding of CCR5 Nt peptides to anti-CCR5 MAbs and soluble 
gpl20/CD4 : EL.ISA was used to test the binding of a panel of 
anti-CCR5 MAbs to peptides 2-18 and 2-18 (P) . We had 
previously demonstrated that MAbs PA8 , PA11 and PA12 bind 

15 epitopes in the Nt, PA10 binds an epitope that spans the Nt 
and ECL2 and PA14 binds an epitope exclusively in ECL.2 
[32]. Here we show that . PA8 avidly binds peptides 2-18 and 
2-18 (P) (Figure 2). PA10 binds avidly to 2-18 and 
moderately to 2-18 (P). PA11 binds moderately to 2-18 and 

20 weakly to 2-18(P). PA12's binding is weak for 2-18 and 
undetectable for 2-18 (P) . Finally, PA14 does not recognize 
either the sulfopeptide or the phosphopept ide (Figure 8) . 
Furthermore, PA 8 binds similarly to all of the CCR5 Nt- 
based sulf opept ides in Figure 12 (data not shown) . 

25 

In order to more precisely delineate the minimal CCR5 Nt 
domain that specifically binds to soluble gpl20/CD4 
complexes, a panel of sulf opeptides spanning different 
stretches of the CCR5 Nt were synthesized (Figure 12) . All 
30 of the peptides carried sulf otyrosines in positions 10 and 
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14 since we previously showed that these are required for 
complex-peptide binding. Binding of gpl 2 0jp.pl/CD4 - IgG 2 to the 
different sulf opeptides was tested by ELISA . Although the 
strongest binding was observed using longest sulf opeptide, 
5 ' 2-18, significant binding for peptide 10-18, which 
demonstrated -3 -fold lower avidity was also observed 
(Figure 9) . Peptides 8-15, 6-16 and 10-15 bound the soluble 
complex at least ten-fold less avidly than 2-18. (It was 
previously shown that a sulfopeptide consisting of residues 
10 10-14 did not bind soluble gpl20/CD4 complexes.) 
Furthermore, the gpl2 0 JR _ rL /CD4-IgG2 complex only weakly 
bound to peptide 10-18 carrying two alanine mutations in 
positions 11 and 18. Previous mutagenesis studies have 
shown that residues Asp-11 and Glu-18 are important for 
15 fusion, entry and gpl20-CCR5 binding [12, 13] . Finally, it 
should be noted that the same binding patterns to the 
different sulf opeptides were observed with soluble 
complexes containing gpl20 DH123 (data not shown) . 

Inhibition of gpl20/CD4 binding to CCR5 Nt sulf ooeot ides by 

_anti-gpl20 MAbs: In order to determine which domains of 

gpl20 were involved in binding to CCR5 Nt-based 
sulf opeptides, the ability of a panel of well -characterized 
anti-gpl20 MAbs [25] to inhibit binding of either the 
gpl20 JR . FL /CD4-IgG2 or the gpl2C DH123 /CD4 - IgG2 complex to the 
2-18 sulfopeptide was tested. Only MAbs directed against 
CD4- induced (CD4i) epitopes and the V3 loop were capable of 
inhibiting binding of the gpl20/CD4- IgG2 complex to the 
CCR5 sulfopeptide. Inhibition of gpl20 JR . FL and gpl20 DH123 
binding by MAbs 17b and 48d was >90%. The anti-V3 loop MAb 
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19b [28] , which recognizes an epitope in the crown of the 

V3 loop (sequence -I G--FY-T) and is reactive with R5 

strains, inhibited gpl20jR.PL binding >90% and gpl20 DH123 
binding by approximately 80%. Anti-V3 loop MAb PA1 , which 
5 was raised against gpl20jR.pL (W.C.O., unpublished results) 
efficiently inhibited binding of gpl2 0 JR . FIj but not gpl20 DH123 . 
Finally, anti-V3 loop MAb 9284 [21] , which recognizes an 
epitope spanning residue 307 to 330 in the V3 loop of X4 
strains, was unable to inhibit binding of either gp!20 

10 protein to the sulf opeptide . MAbs directed against other 
epitopes in other constant and variable .regions of gpl20 
also had no effect on binding of the soluble complex to the 
peptides. Similar results were obtained when the anti-gpl20 
MAbs were used to inhibit soluble complex binding to cell 

15 surface CCR5 (data not shown) . 

Binding of mutant soluble gp12Q/CD 4 complexes to CCR5 — Nt 

sulf o peptides : Numerous studies have shown that residues in 
the V3 loop determine co-receptor usage and binding [6-10, 

20 18, 20-21, 23-24, 27, 29, 33, 41-44, 46]. The crystal 
structure of a gpl2 0 lacking the V1/V2 and V3 loops in 
complex with sCD4 and the 17b MAb further implicated a 
conserved, CD4i surface on gpl20, adjacent to the V3 loop, 
in co-receptor binding [36, 37] . Single alanine mutants of 

25 all of the residues near or within regions previously shown 
to be important for co-receptor usage were generated. These 
gpl20 mutants were tested for their ability to bind to the 
CCR5-based sulfopeptide 2-18 as well as to cell surface 
CCR5 . Binding was normalized for gp!20 mutant binding to 

30 CD4-lgG 2 . Wild-type levels of binding were observed for all 
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mutants except W4 2 7A, R4 4 0A and R4 6 9A, which bound CD4-IgG 2 
with 5-10-fold lower but nonetheless measurable avidity. 



Residues in both strands of the V3 loop stem were found to 
5 be involved in gpl20 binding to the 2-18 sulf opept ide : 
Alanine mutants of residues R298, N301, T303, 1322, D324, 
132 5 and R3 2 6 were found to decrease complex binding to the 
peptide by >10-fold- Residues in the crown of the V3 loop, 
including the GPGR motif, however, had no effect on gpl20 

10 binding to the sulf opept ide . C4 residues in or adjacent to 
the two C-terminal (3-strands of the bridging sheet were 
also shown to participate in binding to the sulf opept ide : 
..Alanine substitutions of R419, 1420, K421, Q422 and R444 
decreased complex binding to the sulfopeptide by 5-10-fold. 

15 None of the alanine substitutions that we introduced in the 
other regions of gpl20 significantly affected complex- 
peptide-interactions . 

It was furthermore demonstrated that additional gpl2 0 
20 residues are involved in complex binding to cell surface 
expressed CCR5 . Alanine substitutions of residues S306, 
G310, P311, R313, F315, Y316 in the crown of the V3 loop 
decreased complex binding to CCR5 by 5-10-fold. 
Furthermore, alanine substitutions of several residues in 
25 CI, C2 and C3 also had a moderate effect on complex binding 
to CCR5 . Finally, it is noted that alanine substitutions of 
R44 0 and R469 increased complex binding to both 2-18 and 
CCR5, whereas substitutions of E320 and W427 increased 
complex binding to CCR5 only. 
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Discussion 

Tyrosine- sulfated CCR5 Nt peptides were studied for binding 
to soluble gpl2 0/CD4 complexes as well as anti-CCR5 MAbs, 
CO chemokines and TAK-779 using a novel solid-phase ELISA. 
5 Inhibition of peptide-complex interactions by anti-gpl20 
MAbs was explored by surface plasmon resonance. These Mabs 
were also tested for their ability to inhibit complex 
binding to cell surface CCR5 . In addition, a panel of 
gp!20 point mutants were generated and then their 

10 reactivity was compared with CCR5 Nt peptides and cell 
surface CCR5 . The principal conclusions are that (1) 
residues 10 to 18 of the CCR5 Nt may define the minimum 
recognition site for gpl20, (2) gpl20 binding to the CCR5 
Nt depends on highly conserved residues located in the C4 

15 domain and the stem of the V3 loop, and (3) gpl2 0 binding 
to cell surface CCR5 depends on a broader region that 
includes residues in the crown of the V3 loop, CI, C2 and 
C3 . The findings suggest that distinct domains of gpl20 
and CCR5 bind in a multi-step fashion and raise questions 

20 about the determinants of specificity of the co-receptor- 
gpl2 0 interaction . 

An ELI SA was developed to detect complex-peptide binding 
based on the observation that the tetravalent gpl20/CD4 - IgG a 

25 complex binds to CCR5 Nt sulf opeptides ten-to a hundred - 
fold more avidly than the monovalent gpl20/sCD4 complex. 
Complex-sulf opeptide binding was only observed with gpl20 
proteins derived from R5 and R5X4 , but not X4 HIV-1 
strains. V3 loop deleted gpl20 JR _ FL failed to bind to the 

30 sulf opeptides . Phosphopept ides did not bind to any of the 
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soluble complexes. Thus, the EL ISA reproduces the critical 
biological features of cell-surface CCR5-gpl20 

interactions, including a dependence upon CD4 , CCR5 
tyrosine sulfation, the V3 loop and the co-receptor usage 
5 patterns of the parental viruses. 

CCR5 Nt phosphopept ides and sulf opept ides were 
differentially recognized by anti-CCR5 MAbs in ELISA. PA 8 
possessed equal avidity for sulfated and phophorylated 

10 peptides, implying that its epitope does not include 
tyrosine side chains. PA10 and PA11 preferentially 

recognized the sulf cpept ide , albeit with varying 
efficiencies, suggesting that sulf otyrosines participate 
either directly in peptide-MAb interactions or indirectly 

15 by influencing epitope conformations. PA12 only interacted 
with the sulfopeptide and PA14 did not bind either Nt 
peptide. Ir was previously shown that both of these MAbs 
recognize discontinuous epitopes comprising residues in the 
Nt and ECLi2 of CCR5 . The observations now imply that ECL2 

20 residues are marginal for PA12 binding and essential for 
PA14 binding to CCR5 . Finally, binding of soluble 

gpl20/CD4 complexes to CCR5 Nt peptides couls be completed 
with an ant i -CCR5 Mab but not with either CC-Chemokines or 
TAK-77S, whose binding sites have been mapped to other 

25 regions of CCR5 (14, 39) . Both CC-chemokines and TAK-779, 
however, are able to compete with gp!20/CD4 binding to cell 
surface CCR5, perhaps through steric or comf ormat ional 
mechanisms. (14, 42, 46). It is noted that Farzan et al . 
reported that a CCR5 Nt sulfopeptide spanning residues 1-22 

30 partially blocks MlP-la binding to cell- surface CCR5 and we 
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attribute the discrepancy to differences in peptides and 
assays (17) . 



In order to more precisely delineate the gpl20 binding site 
5 in the CCR5 Nt , an ELI SA was used to test binding of 
soluble complexes to several CCR5 Nt sulf opept ides . 10-18 
was the smallest sulfopeptide that avidly bound soluble 
gpl20/CD4 complexes and may define the minimum docking site 
for gpl20 on CCR5 . In addition to the two sulf otyrosines 

10 in positions 10 and 14, negatively charged amino acids Dll 
and E18 were found to be critical for complex-peptide 
binding. It was concluded that a cluster of negative 
charges in the CCR5 Nt appears to represent the principal 
recognition motif for gpl20, although residues 2 to 9 

15 further contribute to binding. Similar patterns of peptide 
reactivity were observed for recombinant gpl20s derived 
from H1V-1jr.pl (R5) and HIV-1 DH123 (R5X4) , suggesting that the 
CCR5 Nt sulf opeptides recognize conserved structures in the 
envelope glycoprotein. Gpl2 0 DH123 , however, bound about 

20 five- fold less that gpl20 JR . FL to the sulf opeptides , which 
probably accounts for its less efficient usage of CCR5 
(13) . 



Anti-gpl20 MAbs were tested for their ability to inhibit 
25 gpl20/CD4 binding to sul f opept ides or to cell surface CCR5 . 
A number of anti-gp!20 MAbs directed against conserved and 
variable regions of the envelope glycoprotein were not 
inhibitory. Only Mabs 48d and 17b, directed against CD4i 
epitopes, and 19b and PA1 , directed against the V3 loop, 
30 efficiently inhibited gpl20 binding to the 2-18 



sulfopeptide and to cell surface CCR5 . The CD4i epitope 
was previously shown to participate in co- receptor binding 
and residues in the V3 loop primarily determine co-receptor 
specificity (36, 37) . The results now suggest that these 
regions of gpl20 determine its association with the CCR5 
Nt . It is noted that inhibition of peptide -complex binding 
by 19b, which recognizes an epitope in the V3 crown, is 
inconsistant with the finding by gpl20 mutagenesis 
experiments that residues in the V3 loop crown do not 
participate in complex-peptide binding. This leads to a 
conclusion that the inhibitory effect of 19b may be steric 
hindrance . 

In order to determine more precisely the. regions of gp!20 
that modulate the gpl20-CCR5 itneraction, the binding of a 
panel of gpl20 point mutants to the CCR5 Nt sulfopeptide 
and to cell surface CCR5 was tested. The mutants were 
created by the introduction of single alanine substitutions 
near or within regions previously shown to be important for 
the integrity of the CD4i epitope and/or CCR5 binding (36, 
37) . Highly conserved residues in C4 and the V3 loop stem, 
including for arginines and a lysine, were found to affect 
binding of gpl2 0 to the CCR5 Nt sulfopeptide (Figure 13) . 
These residues are located in two random coil segments of 
C4 that straddle the V3 loop stem and may constitute a 
positively charged CCR5 Nt binding domain (22) 
Additional, conserved residues in the crown of the V3 loop, 
CI, C2 and C3 contribute to gp!20 binding to cell surface 
CCR5 (Figure 13) . It remains to be determined whether 
these residues interact with other extracellular domains of 
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CCR5 or whether they influence the conformation of C4 and 
the V3 loop stem in a way that is only relevant in the 
context of gpl20/CD4 binding to cell surface CCR5 . It is 
unlikely that these residues also interact with the Nt in 
5 the context of cell surface CCR5 because they are 
relatively distal from the C4 and V3 residues that were 
implicated in sulfopeptide binding (22) . 



10 



To date, several lines of evidence suggest that gpl20 binds 
to more that one region of the CCR5 co-receptor: (1) the 
affinity of gpl20s/CD4 for the CCR5 Nt sulfopeptide is 
approximately 10-100 -fold lower that for the native, 
membrane-associated co-receptor (11, 42, 46), (2) co- 
receptor chimera studies implicate the extracellular loops 
15 in viral fusion and entry (2, 12, 34, 38) and (3) 
inhibitors of CCR5 co-receptor function such as Mabs 2D7 
and PA14, as well as TAK-779 do not bind to the CCR5 Nt yet 
block gp!20/CD4 binding to CCR5 (14, 32). The present 
findings could be interpreted to support a distributed 
20 model for gpl2 0-CCR5 interactions that mirrors the two-site 
paradigm proposed for the interaction of certain chemokines 
with their receptors (30, 45) . In this model, binding is 
initially driven by electrostatic interactions between 
negatively charged residues in the receptor Nt and basic 
surfaces on the chemokine ligand. This binding serves to 
orient the ligand and promote its interactions with other 
portions of the chemokine receptor. The V3 loop crown may 
form initial electrostatic interactions with the 
extracellular loops of CCR5 , which would allow the CCR5 Nt 
to bind to a conserved region of gpl20 comprising residues 
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in C4 and the V3 loop stem. Alternatively , the CCR5 Nt 
could first bind the C4/V3 stem domain, which would them 
promote an interaction of the V3 loop with some other 
region of CCR5 . All of these interactions involve 

additional gpl20 residues that we have yet to identify. 
The role of the putative second interaction is unclear but 
it may further stabilize the gp!20-CCR5 interaction, 
optimally orienting the fusion apparatus, or triggering 
gp41 conformational changes that are required for fusion. 



30 



The findings present us with a seeming paradox wherein nine 
residues of the CCR5 Nt confer specificity on the CCR5- 
gpl20 interaction by binding to gpl20 residues that are 
highly conserved among clade B isolates, regardless of 
15 their co-receptor usage. However, although the C4 and V3 
stem residues themselves are conserved, their precise 
placement may differ for R5 and X4 viruses. Clearly, 
relatively minor differences in the orientation, exposure 
or relative positioning of these widely separated residues 
could abrogate binding to a short peptide but not a MAb 
(e.g., 17b) possessing a larger, more distributed binding 
site (37). In addition, more variable amino acids (e.g., 
324) within or outside the C4/V3 loop stem may contribute 
to the specificity of the gpl20-Nt interaction, and we 
showed that residues N279, R313, P369 and R444 participate 
in gpl20/CD4 binding to cell surface CCR5 but not to CCR5 
Nt sulfopeptides. Future studies employing additional 
gpl20 mutants together with CCR5 mutants and CXCR4 -based 
sulfopeptides will shed light on the specificity 
determinants of the gpl20-co-receptor interaction. 
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CD4 and CCR5 mediate fusion and entey of R5 HIV-1 strains. 
Sulfotyrosines and negatively charged residues in the CCR5 
Nt are crucial for binding of gpl20 and viral antry. 
5 Soluble gpl20/CD4 complexes specifically bind to CCR5 Nt 
peptides containing sulf otyrsinces in positions 10 and 14. 
CCR5 Nt sulfotyrosines inhibit gpl20/CD4 binding to CCR5 as 
well as viral entry. Residues in the V3 loop and the C4 
region og gpl20 compose a binding site for the CCR5 amino 

10 terminal domain. Redisues 10-18 of the CCR5 Nt constitute a 
minimal binding domain for gp!20: sulfotyrosines Y10 and 
Y14 and negatively charged residues Dll and E18 are 
important for binding. The CCR5 Nt. terninal binding site on 
gpl2 0 is composed mostly of residues in the V3 loop stem 

15 and the C4 domain. 
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is claimed: 

A compound comprising the structure: 
OaYDINYYTSEpA 

wherein each T represents a threonine, each S 
represents a serine, each E represents a glutamic 
acid, each Y represents a tyrosine; each D represents 
an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 
9 and extending therefrom in the amino terminal 
direction; 

wherein 3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the P at position 
19 and extending therefrom in the carboxy terminal 
direction; 

wherein 9 represents an amino group or an acetylated 
amine group; wherein X represents a carboxyl group or 
an ami as ted carboxyl group; 

wherein all of a , Y,D, I , N, Y, Y, T, S , E and 3 are joined 
together by peptide bonds; 

further pi~ovided that at least two tyrosines in the 
compound are sulfated. 
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2. The compound of claim 1, wherein p represents less 
than 17 amino acids. 

3. The compound of claim 1, wherein |3 represents less 
5 than 16 amino acids. 

4. The compound of claim 1, wherein [3 represents less 
than 15 amino acids. 

10 5. The compound of claim 1, wherein 3 represents less 
than 14 amino acids. 

6. The compound of claim 1, wherein 3 represents less 
than 13 amino acids. 

15 

7. The compound of claim l, wherein 3 represents less 
than 12 amino acids. 

8. The compound of claim 1, wherein 3 represents less 
20 than 11 amino acids. 

9. The compound of claim 1, wherein [3 represents less 
than 10 amino acids. 

25 10. The compound of claim 1, wherein [3 represents less 
than 9 amino acids . 

11. The compound of claim 1, wherein [3 represents less 
than 8 amino acids. 

30 
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12. The compound of claim 1, wherein (3 represents less 
than 7 amino acids . 

13. The compound of claim 1, wherein (3 represents less 
5 than 6 amino acids . 

14. The compound of claim 1, wherein (3 represents less 
than 5 amino acids. 

10 15. The compound of claim 1, wherein (3 represents less 
than 4 amino acids. 

16. The compound of claim 1, wherein (3 represents less 
than 3 amino acids. 

15 

17. The compound of claim 1, wherein p represents less 
than 2 amino acids . 

18. The compound of claim 1, wherein (3 represents less 
20 than 1 amino acid. 

19. The compound of claim 1, wherein a represents less 
than 9 amino acids. 

25 20. The compound of claim 1, wherein a represents less 
than 8 amino acids. 

21. The compound of claim 1, wherein a represents less 
than 7 amino acids. 
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22. The compound of claim 1, wherein a represents less 
than 6 amino acids. 

23. The compound of claim 1, wherein a represents less 
5 than 5 amino acids. 

24. The compound of claim 1, wherein a represents less 
than 4 amino acids . 

10 25. The compound of claim 1, wherein a represents less 
than 3 amino acids. 

26. The compound of claim 1, wherein a represents less 
than 2 amino acids. 

15 

27. The compound of claim 1, wherein a represents less 
than 1 amino acid. 

28. A composition comprising the compound of claim 1 and a 
20 detectable marker attached thereto. 

29. The composition of claim 28, wherein the detectable 
marker is biotin. 

25 30. The composition of claim 28, wherein the detectable 
marker is attached at the C- terminus of the compound. 

31. A composition which comprises a carrier and an amount 
of the compound of claim 1 effective to inhibit 
30 binding of H1V-1 to a CCR5 receptor on the surface of 
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a CD4 + cell. 

A method of inhibiting human immunodeficiency virus 
infection of a CD4 + cell which also carries a CCR5 
receptor on its surface which comprises contacting the 
CD4 + cell with an amount of the compound of claim 1 
effective to inhibit binding of human immunodeficiency 
virus to the CCR5 receptor so as to thereby inhibit 
human immunodeficiency virus infection of the CD4 + 
cell. 

The method of claim 32, wherein the CD4+ cell is 
present in a subject and the contacting is effected by 
administering the compound to the subject. 

A method of preventing CD4+ cells of a subject from 
becoming infected with human immunodeficiency virus 
which comprises administering to the subject an amount 
of the compound of :;laim 1 effective to inhibit 
binding of human :- aunodef iciency virus to CCR5 
receptors on the surface of the CD4+ cells so as to 
thereby prevent the subject's CD4+ cells from becoming 
infected with human immunodeficiency virus. 

A method of treating a subject whose CD4+ cells are 
infected with human immunodeficiency virus which 
comprises administering to the subject an amount of 
the compound of claim 1 effective to inhibit binding 
of human immunodeficiency virus to CCR5 receptors on 
the surface of the subject's CD4+ cells so as to 
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thereby treat the subject. 

The method of any one of claims 33-35, wherein the 
compound is administered by aerosol, intravenous, oral 
or topical route. 



37. The method of claim 33 or 35, wherein the subject is 
infected with HIV-1 prior to administering the 
compound to the subject. 

10 

38. The method of claim 33 or 34, wherein the subject is 
not infected with HIV-1 prior to administering the 
compound to the subject. 

15 39. The method of claim 38, wherein the subject is not 
infected with, but has been . exposed to, human 
immunodeficiency virus. 

40. The method of any one of claims 33-35, wherein the 
20 effective amount of the compound comprises from about 

1.0 ng/kg to about 100 mg/kg body weight of the 
sub j ect . 

41. The method of claim 40, wherein the effective amount 
25 of the compound comprises from about 100 ng/kg to 

about 50 mg/kg body weight of the subject. 
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The method of claim 41, wherein the effective amount 
of the compound comprises from about 1 /ig/kg to about 
10 mg/kg body weight of the subject. 



The method of claim 42, wherein the effective amount 
of the compound comprises from about 10 0 M9/kg to 
about 1 mg/kg body weight of the subject. 

The method of any one of claims 33-35, wherein the 
subject is a human being. 

A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
comprises : 

(a) immobilizing the compound of claim 1 on a solid 
support ; 

(b) contacting the immobilized compound from step (a) 
with sufficient detectable CCR5 ligand to 
saturate all binding sites for the CCR5 ligand on 
the immobilized compound under conditions 
permitting binding of the CCR5 ligand to the 
immobilized compound so as to form a complex; 

(c) removing any unbound CCR5 ligand; 

(d) contacting the complex from step (b) with the 
agent ; and 

(e) detecting whether any CCR5 ligand is displaced 
from the complex, wherein displacement of 
detectable CCR5 ligand from the complex indicates 
that the agent binds to the compound so as to 
thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
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comprises : 

(a) contacting the compound of claim 1 with 
sufficient detectable CCR5 ligand to saturate all 
binding sites for the CCR5 ligand on the compound 

5 under conditions permitting binding of the CCR5 

ligand to the compound so as to form a complex ; 

(b) removing any unbound CCR5 ligand; 

(c) measuring the amount of CCR5 ligand which is 
bound to the compound in the complex; 

10 (d) contacting the complex from step (a) with the 

agent so as to displace CCR5 ligand from the 
complex; 

(e) measuring the amount of CCR5 ligand which is 
bound to the compound in the presence of the 

15 agent; and 

(f) comparing the amount of CCR5 ligand bound to the 
compound in step (e) with the amount measured in 
step (c) , wherein a reduced amount measured in 
step (e) indicates that the agent binds to the 

20 compound so as to thereby identify the agent as 

one which inhibits binding of the CCR5 ligand to 
the CCR5 receptor. 



47. A method of identifying an agent which inhibits 
25 binding of a CCR5 ligand to a CCR5 receptor which 

comprises : 

(a) immobilizing the compound of claim 1 on on a 
solid support; 

(b) contacting the immobilized compound from step (a) 
30 with the agent and detectable CCR5 ligand under 
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conditions permitting binding of the CCR5 ligand 
to the immobilized compound so as to form a 
complex ; 

(c) removing any unbound CCR5 ligand ; 

(d) measuring the amount of detectable CCR5 ligand 
which is bound to the immobilized compound in the 
complex; 

(e) measuring the amount of detectable CCR5 ligand 
which binds to the immobilized compound in the 
absence of the agent ; 

(f ) comparing the amount of CCR5 * ligand which is 
bound to the immobilized compound in step (e) 
with the amount measured in step (d) , wherein a 
reduced amount measured in step (d) indicates 
that the agent binds to the compound or CCR5 
ligand so as to thereby identify the agent as one 
which inhibits binding of the CCR5 ligand to the 
CCR5 receptor. 

The method of claim 47, wherein the amount of the 
detectable ligand in step (a) and step (e) is 
sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 

A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
comprises : 

(a) contacting the compound of claim 1 with the agent 
and detectable CCR5 ligand under conditions 
permitting binding of the CCR5 ligand to the 



compound so as to form a complex; 

(b) removing any unbound CCR5 ligand ; 

(c) measuring the amount of detectable CCR5 ligand 
which is bound to the compound in the complex; 

(d) measuring the amount of detectable CCR5 ligand 
which binds to the compound in the absence of the 
agent ; 

(e) comparing the amount of CCR5 ligand which is 
bound to the compound in step (c) with the amount 
measured in step (d) , wherein a reduced amount 
measured in step (c) indicates that the agent 
binds to the compound or CCR5 ligand so as to 
thereby identify the agent as one which inhibits 
binding of the CCR5 ligand to the CCR5 receptor. 

The method of claim 49, wherein the amount of the 
detectable ligand in step (a) and step (d) is 
sufficient to saturate all binding sites for the CCR5 
ligand on the compound. 



The method of any one of 
detectable CCR5 ligand is 
marker . 



claims 45-50, wherein the 
labeled with a detectable 



A method of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor which 
comprises : 

a) immobilizing the compound of claim 1 on a solid 
support ; 

b) contacting the immobilized compound from step a) 



with the agent dissolved or suspended in a known 
vehicle and measuring the binding signal 
generated by such contact; 

c) contacting the immobilized compound from step a) 
with the known vehicle in the absence of the 
compound and measuring the binding signal 
generated by such contact; 

d) comparing the binding signal measured in step b) 
with the binding signal measured in step c) , 
wherein an increased amount measured in step b) 
indicates that the agent binds to the compound so 
as to thereby identify the agent as one which 
binds to the CCR5 receptor. 

The method of claim 52, wherein the solid support is a 
surface plasmon resonance sensor chip. 

The method of claim 52 or 53, wherein the binding 
signal is measured by surface plasmon resonance. 

A method of obtaining a composition which comprises: 

(a) identifying a compound which inhibits binding of 
a CCR5 ligand to a CCR5 receptor according to the 
method of any one of claims 45-50 and 52; and 

(b) admixing the compound so identified or a homolog 
or derivative thereof with a carrier. 

The method of any one of claims 45-50 and 52, wherein 
the CCR5 ligand is a complex comprising an HIV-1 
envelope glycoprotein and a CD4 -based protein. 
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The method of claim 56, wherein the HIV-1 envelope 
glycoprotein is gpl20, gpl40 or gpl60. 

The method of claim 56, wherein the CD4-based protein 
is soluble CD4 or CD4-IgG2. 

The method of any one of claims 45-50 and 52, wherein 
the CCR5 ligand is a chemokine . 

The method of claim 59, wherein the chemokine is 
RANTES, MlP-la or MIP-1(5. 

The method of any one of claims 45-50 and 52, wherein 
the CCR5 ligand is an antibody. 

The method of claim 61, wherein the antibody is 
selected from the group consisting of PA 8 (ATCC 
Accession No. HB-12605) , PA10 (ATCC Accession 
No. 12607), PA11 (ATCC Accession No. HB-12608) , PA12 
(ATCC Accession No. HB-12609) . 

The method of claim 45 or 47, wherein the solid 
support is a microtiter plate well, a bead or surface 
plasmon resonance sensor chip. 

A compound having the structure : 

A- (aYDINYYTSEpA) n 
wherein each T represents a threonine, each S 
represents a serine, each E represents a glutamic 
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acid, each Y represents a tyrosine; each D represents 
an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
I at position 9 and extending therefrom in the amino 
terminal direction; 

wherein (3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
P at position 19 and extending therefrom in the 
carboxy terminal direction; 

wherein X represents a carboxyl group or an amidated 
carboxyl group ; 

wherein all of a,Y,D, I,N,Y,Y,T,S,E and p are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein n is an integer from 1 to 8 , A is a polymer, 
and the solid line represents up to 8 linkers which 
attach the structure in parentheses to A. 

A compound having the structure: 
(BaYDINYYTSEP) n -A 

wherein each T represents a threonine, each S 
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represents a serine, each E represents a glutamic 
acid, each Y represents a tyrosine; each D represents 
an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine,- 
5 wherein a represents from 0 to 9 amino acids, with the 

proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
consecutive order and have a sequence identical to the 
sequence set forth in SEQ ID NO: 1 beginning with the 
10 I at position 9 and extending therefrom in the amino 

terminal direction; 

wherein (3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined together by peptide bonds in 
15 consecutive order and have a sequence identical to the 

sequence set forth in SEQ ID NO: 1 beginning with the 
P at position 19 and extending therefrom in the 
carboxy terminal direction; 

wherein 0 represents an amino group or an acetylated 
20 amino group; 

wherein all of a , Y , D , I , N, Y , Y , T, S , E and (3 are joined 
together by peptide bonds, 

further provided that at least two tyrosines in the 
compound are sulfated, 
25 wherein n is an integer from 1 to 8, A is a polymer, 

and the solid line represents up to 8 linkers which 
attach the structure in parentheses to A. 



30 



66 . 



The compound of claim 64 or 65, wherein the polymer is 
selected from the group consisting of a linear lysine 



polymer, a branched lysine polymer, a linear arginine 
polymer, a branched arginine polymer, polyethylene 
glycol, a linear acetylated lysine polymer, a branched 
acetylated lysine polymer, a linear chloroacetylated 
lysine polymer and a branched chloroacetylated lysine 
polymer . 

The compound of claim 1, wherein the compound is a 
peptide which comprises consecutive amino acids having 
the sequence YDINYYTSE . 

The compound of claim 67, wherein the tyrosines at 
positions 1 and 5 of the sequence YDINYYTSE are 
sulfated . 

A compound comprising the structure: 

9aYDnnYnnnE(3A 

wherein each E represents a glutamic acid, and each Y 
represents a tyrosine ; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 
9 and extending therefrom in the amino terminal 
direction; 

wherein (3 represents from 0 to 13 amino acids, with 
the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 



forth in SEQ ID NO: 1 beginning with the P at position 
19 and extending therefrom in the carboxy terminal 
direction; 

wherein 6 represents an amino group or an acetylated 
5 amino group; wherein A represents a carboxyl group or 

an amidated carboxyl group; 
wherein n represents any amino acid, 

wherein all of a, Y , D, IT , n , Y, n , n , n , E and (3 are joined 
together by peptide bonds ; 
10 further provided that at least two tyrosines in the 

compound are sulfated. 

70. The compound of claim 69, wherein the compound is a 
peptide which comprises consecutive amino acids have 

15 the sequence YDnnYniUlE . 

71. The compound of claim 70, wherein the tyrosines at 
positions 1 and 5 of the sequence YDnnYnnnE are 
sulfated . 

20 

72. A compound comprising the structure: 
6aYDINYYTSE(3X 

wherein each T represents a threonine, each S 
represents a serine, each E represents a glutamic 
25 acid, each Y represents a tyrosine; each D represents 

an aspartic acid, each I represents an isoleucine; and 
each N represents an asparagine; 

wherein a represents from 0 to 9 amino acids, with the 
proviso that if there are more than 2 amino acids, 
30 they are joined by peptide bonds in consecutive order 
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and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the I at position 
9 and extending therefrom in the amino terminal 
direction; 

5 wherein 3 represents from 0 to 13 amino acids, with 

the proviso that if there are more than 2 amino acids, 
they are joined by peptide bonds in consecutive order 
and have a sequence identical to the sequence set 
forth in SEQ ID NO: 1 beginning with the P at position 
10 19 and extending therefrom in the carboxy terminal 

direction; 

wherein 0 represents an amino group or an acetylated 
amino group; wherein A represents a carboxyl group or 
an amidated carboxyl group; 
15 wherein all of a , Y, D , I , N, Y , Y, T , S , E and |3 are joined 

together by peptide bonds ; 

further provided that at least two tyrosines in the 
compound are sulfated, 

wherein any amino acid except for the Y at position 1, 
20 D at position 2, Y at position 5 and E at position 9 

may be replaced with a homologous amino acid, 

73. The compound of claim 72, wherein any I amino acid 
residue is be replaced with a G,A,V or L amino acid 

25 residue. 

74. The compound of claim 72, wherein any N amino acid 
residue is replaced with a Q amino acid residue. 



30 75. The compound of claim 72, wherein any Y amino acid 



-133- 

residue is replaced with an F or W amino acid residue. 

76. The compound of claim 72, wherein any T amino acid 
residue is replaced with an S amino acid residue. 

5 

77. The compound of claim 72, wherein any S is replaced 
with a T amino acid residue. 

78. The compound of claim 72, wherein any C is replaced 
10 with an M, S, T, A, G, N, or Q amino acid residue. 
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SEQUENCE LISTING 

<110> Progenies Pharmaceuticals, Inc., et al. 

<120> SULFATED CCR5 PEPTIDES FOR HTV-1 INFECTION 

<130> 2048/61 01 0-A-PCT/JPW/SHS/AX 

<140> NOT YET KNOWN 

<141> 2001-02-28 

<160> 17 

<170> Patentln version 3.0 

i 

<210> 1 
<211> 352 
<212> PRT 
<213> human 
<400> 1 

Met Asp Tyr Gin Val Ser Ser Pro He Tyr Asp He Asn Tyr Tyr Thr 
15 10 15 

Ser Glu Pro Cys Gin Lys He Asn Val Lys Gin He Ala Ala Arg Leu 
20 25 30 

Leu Pro Pro Leu Tyr Ser Leu Val Phe He Phe Gly Phe Val Gly Asn 
35 40 45 

Met Leu Val He Leu He Leu He Asn Cys Lys Arg Leu Lys Ser Met 
50 55 60 

Thr Asp ne Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65 70 75 80 

Leu Thr Val Pro Phe Tip Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 
85 90 95 



Gly Asn Thr Met Cys Gin Leu Leu Thr Gly Leu Tyr Phe lie Gly Phe 
100 105 110 

Phe Ser Gly He Phe Phe He He Leu Leu Thr He Asp Arg Tyr Leu 
115 120 125 

Ala Val Val His Ala Val Phe Ala Leu Lys Ala Arg Thr Val Thx Phe 
130 135 140 

Gly Val Val Thr Ser Val He Thr Trp Val Val Ala Val Phe Ala Ser 
145 150 155 160 

Leu Pro Gly He He Phe Thr Arg Ser Gin Lys Glu Gly Leu His Tyr 
165 170 175 

Thr Cys Ser Ser His Phe Pro Tyr Ser Gin Tyr Gin Phe Trp Lys Asn 
180 185 190 

Phe Gin Thr Leu Lys He Val He Leu Gly Leu Val Leu Pro Leu Leu 
195 200 205 

Val Met Val He Cys Tyr Ser Gly He Leu Lys Thr Leu Leu Arg Cys 
210 215 220 

Arg Asn Glu Lys Lys Arg His Arg Ala Val Arg Leu He Phe Thr He 
225 230 235 240 

Met He Val Tyr Phe Leu Phe Trp Ala Pro Tyr Asn He Val Leu Leu 
245 250 255 

Leu Asn Thr Phe Gin Glu Phe Phe Gly Leu Asn Asn Cys Ser Ser Ser 
260 265 270 

Asn Arg Leu Asp Gin Ala Met Gin Val Thr Glu Thr Leu Gly Met Thr 
275 280 285 

His Cys Cys He Asn Pro He lie Tyr Ala Phe Val Gly Glu Lys Phe 
290 295 300 

Arg Asn Tyr Leu Leu Val Phe Phe Gin Lys His He AJa Lys Arg Phe 
305 310 315 320 



Cys Lys Cys Cys Ser lie Phe Gin Gin Glu Ala Pro Glu Arg Ala Ser 
325 330 335 



Ser Val Tyr Thr Arg Ser Thr Gly Glu Gin Glu He Ser Val Gly Leu 
340 345 350 



<210> 2 
<211> 1376 
<212> DNA 
<213> human 

<400> 2 

gaattccccc aacagagcca agctctccat ctastggaca gggaagctag cagcaaacct 60 

I 

tcccttcact acaaaacttc attgcttggc caaaaagaga gttaattcaa tgtagacatc 120 
tatgtaggca attaaaaacc tattgatgta taaaacagtt tgcattcatg gagggcaact ISO 
aaatacattc taggacttta taaaagatca ctttttattt atgcacaggg tggaacaaga 240 
tggattatca agtgtcaagt ccaatctatg acatcaatta ttatacatcg gagccctgcc 300 
aaaaaatcaa tgtgaagcaa atcgcagccc gcctcctgcc tccgctctac tcactggtgt 360 
tcatctttgg ttttgtgggc aacatgctgg tcatcctcat cctgataaac tgcaaaaggc 420 
tgaagagcat gactgacatc tacctgctca acctggccat ctctgacctg tttttccttc 480 
ttactgtccc cttctgggct cactatgctg ccgcccagtg ggactttgga aatacaatgt 540 
gtcaactctt gacagggctc tattttatag gcttcttctc tggaatcttc ttcatcatcc 600 
tcctgacaat cgataggtac ctggctgtcg tccatgctgt gtttgcttta aaagccagga 660 
cggtcacctt tggggtggtg acaagtgtga tcacttgggt ggtggctgtg ttlgcgtctc 720 
tcccaggaat catctttacc agatctcaaa aagaaggtct tcattacacc tgcagctctc 780 
attttccata cagtcagtat caattctgga agaatttcca gacattaaag atagtcatct 840 
tggggctggt cctgccgctg cttgtcatgg tcatctgcta ctcgggaatc ctaaaaactc 900 



tgcttcggtg tcgaaatgag aagaagaggc acagggctgt gaggcttatc ttcaccatca 960 
tgattgttta ttttctcttc tgggctccct acaacattgt ccttctcctg aacaccttcc 1 020 
aggaattctt tggcctgaat aattgcagta gctctaacag gttggaccaa gctatgcagg 1080 
tgacagagac tcttgggatg acgcactgct gcatcaaccc catcatctat gcctttgtcg 1 140 
gggagaagtt cagaaactac ctcttagtct tcttccaaaa gcacattgcc aaacgcttct 1200 
gcaaatgctg ttctattttc cagcaagagg ctcccgagcg agcaagctca gtttacaccc 1260 
gatccactgg ggagcaggaa atatctgtgg gcttgtgaca cggactcaag tgggctggtg 1320 
acccagtcag agttgtgcac atggcttagt tttcatacac agcctgggct gggggt 1376 

<210> 3 
<211> 49 
<212> PRT 

<213> human immunodeficiency virus 
<400> 3 

Arg Gin Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg 
1 5 10 15 

Ala He Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He 
20 25 30 

Lys Gin Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp 
35 40 45 

Gin 



<210> 4 
<211> 35 



5 

<212> PRT 

<213> human immunodeficiency virus 
<400> 4 

Tip Met Glu Tip Asp Arg Glu He Asn Asn Tyr Thr Ser Leu He His 
15 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 
20 25 30 

Leu Leu Glu 

35 , 

j 

<210> 5 

<211> 22 1 
<212> PRT 
<213> human 

<400> 5 

Leu Leu Thr Val Glu Gin Ala Leu Ala Asp Phe Ala Glu Leu Leu Arg 
15 10 15 

Ala Leu Arg Arg Asp Leu 
20 

<210> 6 
<211> 33 
<212> PRT 



6 

<213> human 
<400> 6 

His Met Lys Gin Leu Glu Asp Lys Val G]u Glu Leu Leu Ser Lys Asn 
1 5 10 15 

Tyr His Leu Glu Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu 
20 25 30 

Arg 

<210> 7 
<211> 33 
<212> PRT 
<213> human 

<400> 7 

His Met Lys Gin He Glu Asp Lys He Glu Glu lie Leu Ser Lys lie 
1 5 10 15 

Tyr His He Glu Asn Glu He Ala Arg lie Lys Lys Leu lie Gly Glu 
20 25 30 

Val 

<210> 8 
<211> 40 
<212> PRT 
<213> human 



7 



<400> 8 

Leu Thr Asp Thr Leu Gin Ala Glu Thr Asp Gin Leu Glu Asp Glu Lys 
15 10 15 

Ser Ala Leu Gin Thr Glu He Ala Asn Leu Leu Lys Glu Lys Glu Lys 
20 25 30 

Leu Glu Phe He Leu Ala Ala Arg 
35 40 

<210> 9 

<211> 40 

<212> PRT ' 

i 

<213> human 



<400> 9 

His Met Arg Arg He Ala Arg Leu Glu Glu Lys Val Lys Thr Leu Lys 
15 10 15 

Ala Gin Asn Ser Glu Leu Ala Ser Thr Ala Asn Met Leu Arg Glu Gin 
20 25 30 

Val Ala Gin Leu Lys Gin Lys Tyr 
35 40 



<210> 10 
<211> 36 
<212> PRT 
<213> unknown 



8 

<220> 

<221> PEPTIDE 
<222> (1)..(36) 
<223> T-20 

<400> 10 

Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin Gin 
15 10 15 

Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Tip Ala Ser Leu 
20 25 30 

Trp Asn Trp Phe 
35 

<210> 11 
<211> 38 
<212> PRT 
<213> unknown 

<220> 

<221> PEPTIDE 
<222> (1).-(3S) 
<223> DP 107 



<400> 11 

Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
15 10 15 

Thr Val Tip Gly He Lys Gin Leu Gin Ala Arg lie Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Lys Asp Gin 
35 

<210> 12 
<211> 34 

<212> PRT 1 

<213> unknown ■ 

i 

<220> 

<221> PEPTIDE 
<222> (1)..(34) 
<223> N34 

<400> 12 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Tin- Val Tip Gly He Lys Gin Leu Gin 
20 25 30 

Ala Axg 



# 



10 

<210> 13 
<211> 28 
<212> PRT 
<213> unknown 

<220> 

<221> PEPTIDE 
<222> (1)..(28) 
<223> C28 

<400> 13 

Trp Met Glu Tip Asp Arg Glu He Asn Asn Tyr Thr Ser Leu He His 
15 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys 
20 25 

<210> 14 

<211> 68 

<212> PRT 

<213> unknown 

<220> 

<221> PEPTIDE 
<222> (1)..(6S) 



11 

<223> N34(L6)C2S 
<400> 14 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Tip Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Ser Gly Gly Arg Gly Gly Tip Met Glu Trp Asp Arg Glu He 
35 40 45 

Asn Asn Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn 
50 55 60 

i 

Gin Gin Glu Lys • 

65 ! 

<210> 15 
<2U> 39 
<212> PRT 
<213> unknown 

<220> 

<221> PEPTIDE 
<222> (1)..(39) 
<223> T1249 

<400> 15 

Tip Gin Glu Trp Glu Gin Lys He Thr Ala Leu Leu Glu Gin Ala Gin 
15 10 15 



12 

He Gin Gin Glu Lys Asn Glu Tyr Glu Leu Gin Lys Leu Asp Lys Tip 
20 25 30 

Ala Ser Leu Trp Glu Tip Phe 
35 

<210> 16 
<211> 502 
<212> PRT 

<213> human immunodeficiency virus 



<400> 16 

Met Arg Val Lys Gly He Arg Lys Ser Tyr Gin Tyr Leu Trp Lys Gly 
15 10 15 

Gly Tlxi- Leu Leu Leu Gly He Leu Met He Cys Ser Ala Val Glu Lys 
20 25 30 

Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 
35 40 45 

Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
50 55 60 

His Asn Val Tip Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 

Gin Glu Val Val Leu Glu Asn Val Thr Glu His Phe Asn Met Trp Lys 
85 90 95 

Asn Asn Met Val Glu Gin Met Gin Glu Asp He He Ser Leu Trp Asp 
100 105 110 

Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

Asn Cys Lys Asp Val Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly 
130 135 140 



13 

Thr Met Glu Arg Gly Glu He Lys Asn Cys Ser Phe Asn He Tlir Thr 
145 150 155 160 

Ser He Arg Asp Glu Val Gin Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 
165 170 175 

Asp Val Val Pro He Asp Asn Asn Asn Thr Ser Tyr Arg Leu He Ser 
180 185 190 

Cys Asp Tin- Ser Val He Thr Gin Ala Cys Pro Lys He Ser Phe Glu 
195 200 205 

Pro He Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu Lys 
210 215 220 

Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser 
225 230 235 240 

Thr Val Ghr Cys Thr His Gly He Arg Pro Val Val Ser Thr Gin Leu 
245 250 255 

Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val He Arg Ser Asp 
260 265 270 

Asn Phe Thr Asn Asn Ala Lys Thr He He Val Gin Leu Lys Glu Ser 
275 280 285 

Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He 
290 295 300 

His He Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Glu He He Gly 
305 310 315 320 

Asp He Arg Gin Ala His Cys Asn lie Ser Arg Ala Lys Trp Asn Asp 
325 330 335 

Thr Leu Lys Gin He Val He Lys Leu Arg Glu Gin Phe Glu Asn Lys 
340 345 350 

Thr He Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu He Val Met 
355 360 365 

His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin 
370 375 380 



14 



Leu Phe Asn Ser Thr Tip Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr 
385 390 395 400 

Glu Gly Asn Thr He Thr Leu Pro Cys Arg He Lys Gin Be lie Asn 
405 410 415 

Met Tip Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro He Arg Gly 
420 425 430 

Gin He Arg Cys Ser Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp 
435 440 445 

Gly Gly He Asn Glu Asn Gly Thr Glu He Phe Arg Pro Gly Gly Gly 
450 455 460 

Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val 
465 470 475 480 

Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg 1 Val 
485 490 495 

Val Gin Arg Glu Lys Arg 
500 



<210> 17 

<211> 511 

<212> PRT 

<213> human immunodeficiency virus 



<400> 17 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Tip Arg Trp Gly Trp Arg 
15 10 15 

Tip Gly Thr Met Leu Leu Gly Met Leu Met He Cys Ser Ala Thr Glu 
20 25 30 

Lys Leu Trp Val Tin- Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 



15 

35 40 45 

Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Tlir Glu 
50 55 60 

Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 70 75 SO 

Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
85 90 95 

Lys Asn Asp Met Val Glu Gin Met His Glu Asp He He Ser Leu Trp 
100 105 110 

Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser 
115 120 125 

Leu Lys Cys Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser 
130 135 140 

Gly Arg Met He Met Glu Lys Gly Glu He Lys Asn Cys Ser Phe Asn 
145 " 150 155 160 

He Ser Tin" Ser He Arg Gly Lys Val Gin Lys Glu Tyr Ala Phe Phe 
165 170 175 

Tyr Lys Leu Asp He He Pro lie Asp Asn Asp Thr Thr Ser Tyr Lys 
180 185 190 

Leu Thr Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys Pro Lys Val 
195 200 205 

Ser Phe Glu Pro He Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala 
210 215 220 

He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr 
225 230 235 240 

Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val Ser 
245 250 255 

Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val lie 
260 265 270 



16 



Arg Ser Val Asn Phe Thr Asp Asn Ala Lys Thr lie He Val Gin Leu 
275 2S0 285 

Asn Thr Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg 
290 295 300 

Lys Axg He Arg He Gin Arg Gly Pro Gly Arg Ala Phe Val Thr He 
305 " 310 315 320 

Gly Lys He Gly Asn Met Arg Gin Ala His Cys Asn He Ser Arg Ala 
325 330 335 

Lys Trp Asn Asn Thr Leu Lys Gin He Ala Ser Lys Leu Arg Glu Gin 
340 345 350 

Phe Gly Asn Asn Lys Thr He He Phe Lys Gin Ser Ser Gly Gly Asp 
355 360 365 

Pro Glu lie Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 
370 375 380 

Cys Asn Ser Thr Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp 
385 390 395 400 

Ser Thr Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr He Thr Leu 
405 410 415 

Pro Cys Arg He Lys Gin He He Asn Met Trp Gin Lys Val Gly Lys 
420 425 430 

Ala Met Tyr Ala Pro Pro He Ser Gly Gin He Arg Cys Ser Ser Asn 
435 440 445 

He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asn Glu 
450 455 460 

Ser Glu He Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg 
465 470 475 480 

Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val 
485 490 495 

Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg 
500 505 510 



